/ 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 5/04, 15/00, 15/09, 15/11, 15/29, 
15/63, 15/74, 15/81, 15/82, A01H 5/00 



Al 



(11) International Poblkation Number: 
(43) International Publication Date: 



WO 00/18886 

6 April 2000 (06.04.00) 



(21) Internationa] Application Number: PCTAJS99Y22363 

(22) International Filing Date: 24 September 1999 (24.09.99) 



(30) Priority Data: 
60/101314 



25 September 1998 (25.09.98) US 



(71) Applicant (for all designated States except US): WISCONSIN 

ALUMNI RESEARCH FOUNDATION [US/US]; 614 Wal- 
nut Street, Madison, WI 53705 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): SPALDING, Edgar, P. 
[US/US]; 7417 South Avenue, Middleton, WI 53562 (US). 
NOH, Bosl [US/US]; 308E Eagle Heights, Madison, WI 
53705 (US). 

(74) Agents: KLANN, Ellen, M. et al.; Darm, Dorfman, Herrell and 
Skillman, Suite 720, 1601 Market Street, Philadelphia, PA 
19103 (US). 



(81) Designated Slates: AE, AL, AM, AT, AU, AZ, B A. BB, BG, 
BR, BY, CA, CH, CN, CR. CU, CZ, DE, DK, DM, EE, 
ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, 
KE, KG, KP, KR, KZ, LC. LK, LR, LS, LT, LU, LV, MD, 
MG, MK, MN, MW, MX, NO. NZ, PL, PT, RO, RU, SD, 
SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, UG, US, 
UZ, VN. YU, ZA, ZW, ARIPO patent (GH, GM, KE, LS, 
MW, SD, SL, SZ, TZ, UG, ZW), Eurasian patent (AM, AZ, 
BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, BE, 
CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, 
NL. PT, SE), OAPI patent (BF, BJ, CF, CG, a, CM, GA, 
GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments* 



(54) Title: XENOBIOTIC DETOXIFICATION GENE FROM PLANTS 
t57) Abstract 

A novel plant gene is provided, which is a member of the mdr family of genes encoding ABC transporters. The gene is inducible by 
NPPB, and is preferentially expressed in roots upon induction. The gene is useful for detoxification of certain xenobiotics to protect plants 
from the detrimental effects of such compounds. Also provided are plants that over-express and under-express this mdr gene. 



BNSDOCI0t<WO 0018886A1 I > 



FOR THE PURPOSES OP INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



Air 


Albania 


ES 


Spain 


LS 




SI 


Slovenia 


AM 


Armenia 


n 


TTi.i 1... rl 


LT 




SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Latvia bCCIS 


SN 


Senega] 


AU 


Australia 


GA 


Gabon 


LV 




SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GB 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BB 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Gieecc 




Repubfic 


TR 


Turkey 


BG 


Bulgaria - 


HU 


Hungary 


ML 




TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


1L 


Israel 


MR 


Masritaxua 


UG 


Uganda 


BV 


Belarus 


IS 




MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 




uz 


Uzbekistan 


CF 




JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


M itli ■ 1 t 

rterocnaiaji 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


a 


Cote d'rvooe 


KP 


Democratic People's 


NZ 


New Zealand 






CM 






Republic of Korea 


PL 


-Poland 






CN 


China 


KR 


Republic of Korea 


FT 


Portugal 






cu 


Cuba 


KZ 


Kazakhstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 




SD 


Sudan 






DK 

EE 


Denmark 
Estonia 


LK 
LR 


Sri Lanka 
Liberia 


SB 
SG 


Sweden 
Singapore 







BNSOOCID: <WO 0018888A1 I > 



WO 00/18886 PCT/US99/22363 



XENOBIOTIC DETOXIFICATION GENE FROM PLANTS 

This application claims priority to U.S. 
60/101,814, filed September 25, 1998, the entirety of 
which is incorporated by reference herein. 



5 Pursuant to 35 U.S.C. §202 (c), it is 

acknowledged that the U.S. Government has certain rights 
in the invention described herein, which was made in part 
with funds from the National Science Foundation, Grant 
No. IBN-9416016. 

10 

FIELD OF THE INVENTION 

This invention relates to the field of stress 
resistance in plants. In particular, the invention 
provides a novel gene from plants, which encodes an MDR- 
15 like ABC transporter, involved in detoxification of 
certain xenobiotics to protect plants from their 
detrimental effects. 

BACKGROUND OF THE INVENTION 

20 Several publications are referenced in this 

application to describe the state of the art to which the 
invention pertains. Each of these publications is 
incorporated by reference herein. 

Environmental stress is one of the most 

25 important limitations oh plant productivity, growth and 
survival. An ever-increasing source of environmental 
stress to plants is the stress caused by environmental 
pollutants in the soil, water and atmosphere. Such 
pollutants include herbicides, pesticides and related 

30 agronomic products, as well as organic and inorganic 

waste material from industry and other sources. Other 
toxic agents that threaten the survival of plants include 
various toxins produced by ephiphytic or soilborne 
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microorganisms/ such as fungi and bacteria. 

In order to survive in toxic environments, 
plants must have mechanisms to detoxify xenobiotics, 
heavy metals and other toxic compounds. This generally 
5 involves modification of the toxic compound and 

subsequent excretion into the vacuole or apoplastic 
space. Recently, certain ATP-binding cassette (ABC) 
transporters have been identified in plants, which appear 
to be involved in the detoxification process. 

10 The ABC transporter family is very large, with 

representatives existing in many different classes of 
organisms. Two well studied groups of ABC transporters, 
encoded by mdr and mrp genes, respectively, are 
associated with the multi-drug resistance phenomenon 

15 observed in mammalian tumor cells. The mdr genes encode 
a family of P-glycoproteins that mediate the energy- 
dependent efflux of certain lipophilic drugs from cells. 
The mrp genes encode a family of transporters that 
mediate the extrusion of a variety of organic compounds 

20 after their conjugation with glutathione. YCF1, the 
yeast homolog of mrp, encodes a protein capable of 
glutathione-mediated detoxification of heavy metals. 

Homologs of mrp and mdr genes have been 
identified in plant species. In AraJbidopsis thaliana, 

25 the glut athione-con jugate transporter encoded by the mrp 
homolog is located in the vacuolar membrane and is 
responsible for sequestration of xenobiotics in the 
central vacuole (Tommasini et al., FEBS Lett. 411; 206- 
210, 1997; Li et al., Plant Physiol. 107: 1257-1268, 

30 1995) . An mdr-like gene (atpgpl) has also been 

identified in A. thaliana, which encodes a putative P- 
glycoprotein homolog. The atpgpl gene was found to share 
significant sequence homology and structural organization 
with human mdr genes, and was expressed with particular 
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abundance in inflorescence axes (Dudler & Hertig, J. 
Biol. Chem. 267: 5882-5888, 1992). Other MDR homologs 
have been found in potato (Wang et al., Plant Mol. Biol. 
31: 683, 1996) and barley (Davies et al., Gene 199: 195, 
5 1997). 

The aforementioned mrp and mdr plant homologs 
were identified as a result of an effort to understand 
the molecular basis for development in plants of cross- 
resistance to herbicides of unrelated classes. However, 

10 these transporters are likely to serve the more general 
role in plants of sequestering, secreting, or otherwise 
detoxifying various organic and inorganic xenobiotics. 
Accordingly, it will constitute an advance in the art of 
plant genetic engineering of stress tolerance to identify 

15 and characterize other members of this class of 
transporters in plants. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, a new 
20 plant mdr hpmolog has been identified. Unlike the 

previously identified plant mdr homologs* this new gene 
is inducible by a class of compounds known to inhibit 
chloride ion channels . 

According to one aspect of the invention, a 
25 nucleic acid isolated from a plant is provided, which 

encodes a p-glycoprotein that is inducible by exposure of 
the plant to NPPB. The isolated nucleic acid is 
preferentially expressed in plant roots upon exposure of 
the plant to NPPB. In a preif erred embodiment, the plant 
30 from which the nucleic acid is isolated is selected from 
the group consisting of Brassica napus and Arabidopsis 
thaliana and is 3850-4150 nucleotides in length. In a 
more preferred embodiment, the nucleic acid has the 
restriction sites shown in Figure 4 for at least three 
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restriction enzymes. In particularly preferred 
embodiments , the nucleic acid molecule encodes a 
polypeptide having SEQ ID NO: 2. In an exemplary 
embodiment, the nucleic acid is a cDNA comprising the 
coding region of SEQ ID NO:l or SEQ ID NO: 10. 

According to another aspect of the invention is 
an expression cassette that comprises a plPAC gene 
operably linked to a promoter, and in a more preferred 
embodiment the plPAC gene is from Arabidopsis. In 
preferred embodiments, the expression cassette comprises 
the cauliflower mosaic virus 35S promoter, and part of 
all of SEQ ID NO:l or SEQ ID NO: 10. Further included in 
this aspect is a vector comprising the expression 
cassette and a method for producing transgenic plants 
with the expression cassette and vector. 

Another aspect of the invention are transgenic 
cells and plants containing the nucleic acids of the 
invention. In one preferred embodiment, the nucleic 
acids are be in the aforementioned expression cassette. 
Furhter included in this aspect are reporductive units 
from the transgenic plant. 

According to another aspect of the invention, 
an isolated nucleic acid molecule is provided, which has 
a sequence selected from the group consisting of: a) SEQ 
ID N0:1 and SEQ ID NO: 10; b) a nucleic acid sequence 
that is at least about 60% homologous to the coding 
regions of SEQ ID NO:l or SEQ ID NO: 10; c) a sequence 
hybridizing with SEQ ID NO:l or SEQ ID NO: 10 at moderate 
stringency; d) a sequence encoding part or all of a 
polypeptide having SEQ ID NO: 2; e) a sequence encoding an 
amino acid sequence that is at least about 70% identical 
to SEQ ID NO:2; f) a sequence encoding an amino acid 
sequence that is at least about 80% similar to SEQ ID 
NO: 2; g) a sequence encoding an amino acid sequence that 
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is at least about 40% similar to residues 1-76/ 613-669 
or 1144-1161 of .SEQ ID N0:2; and h) a sequence 
hybridizing at moderate stringency to a sequence encoding 
residues 1-76, 613-669 or 1144-1161 of SEQ ID N0:2. A 
5 polypeptide. produced by expression of the above listed 
sequences is also provided. 

According to another aspect of the invention, 
an isolated plant p-glycoprotein, which is inducible upon 
exposure of the plant to NPPB, is provided. The 

10 polypeptide preferably confers upon a cell in which it is 
found resistance to Rhodamine 6G. The polypeptide is 
preferentially produced in roots upon the exposure to the 
NPPB. The polypeptide is preferrably from Brassica napus 
or Arabidopsis thaliana. In most preferred embodiments, 

15 the polypeptide has a sequence that is a) an amino acid 
sequence that is at least 80% similar to SEQ ID NO: 2; b) 
an amino acid sequence that is at least 70% identical to 
SEQ ID NO: 2; c) an amino acid sequence that is at least 
40% similar to residues 1-76, 613-669 or 1144-1161 of SEQ 

20 ID NO: 2; and d) an amino acid sequence encoded by a 

nucleic acid sequence hybridizing at moderate stringency 
to a amino acid sequence encoding residues 1-76, 613-669 
or 1144-1161 of SEQ ID N0:2. 

According to other aspects of the invention, 
-J-. 25 antibodies immunologically specific for the polypeptides 
of the invention are provided, that immunologically 
specific to any of the polypeptides, of polypeptide 
encoded by the nucleic acids of the invention. In a 
preferred embodiment, the antibody is immunospecif ic to 

30 residues 1-76, 613-669 or 1144-1161 of SEQ ID NO: 2. 

According to another aspect of the invention, a 
plant p-glycoprotein gene promoter, which is inducible by 
NPPB, is also provided. In a preferred embodiment/ the 
promoter is part or all of residues 1-3429 of SEQ ID 
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N0:10. 

According to another aspect of the invention, 
plants that have reduces levels of plPAC protein are 
provided. In a preferred embodiment, these plants have 
5 mutations in the plPAC gene, and in a particularly 

preferred embodiment, the plPAC gene is mutated due to 
the insertion of a T-DNA. Also provided with this aspect, 
is a method for selecting plants with mutations in plPAC 
using SEQ ID NOS: 11-14 as PCR primers. 
10 These and other features and advantages of the 

present invention will be described in greater detail in 
the description and examples set forth below. 

BRIEF DESCRIPTION OF THE DRAWINGS 
15 Figure 1. Amino acid sequence lineup of ATPAC 

deduced amino acid sequence and the amino acid sequences 

of related mammalian and plant genes. The lineup shows 

the ATPAC deduced amino acid sequence (SEQ ID NO: 2) 

compared with (1) hmdrl (SEQ ID NO:3); (2) mmdrl (SEQ ID 
20 NO: 4); (3) hmdr3 (SEQ ID NO:5); (4) mmdr2 (SEQ ID NO:6); 

(5) atpgpl (SEQ ID NO:7); and (6) atpgp2 (SEQ ID NO:8). 

A consensus sequence (SEQ ID NO: 9) is also shown. 

Figure 2. Graph depicting the effect of 

rhodamine 6G on the growth rate of cells transformed with 
25 and expressing ATPAC as compared with control cells not 

containing ATPAC. 

Figure 3. Restriction map of genomic clone of 

ATPAC, SEQ ID NO: 10. 

Figure 4 . Restriction map of cDNA clone of 
30 ATPAC, SEQ ID NO:l. 

DETAILED DESCRIPTION OF THE INVENTION 
I • Definitions 

Various terms relating to the biological 
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molecules of the present invention are used hereinabove 
and also throughout the specification and claims. 

With reference to nucleic acids of the 
invention, the term "isolated nucleic acid" is sometimes 
used. This term, when applied to DNA, refers to a DNA 
molecule that is separated from sequences with which it 
is immediately contiguous (in the 5 f and 3' directions) 
in the naturally occurring genome of the organism from 
which it was derived. For example, the "isolated nucleic 
acid" may comprise a DNA molecule inserted into a vector, 
such as a plasmid or virus vector, or integrated into the 
genomic DNA of a procaryote or eucaryote. An "isolated 
nucleic acid molecule" may also comprise a cDNA molecule. 

With respect to RNA molecules of the invention 
the term "isolated nucleic acid" primarily refers to an 
RNA molecule encoded by an isolated DNA molecule as 
defined above. Alternatively, the term may refer to an 
RNA molecule that has been sufficiently separated from 
RNA molecules with which it would be associated in its 
natural state (i.e., in cells or tissues), such that it 
exists in a "substantially pure" form (the term 
"substantially pure" is defined below) . 

Nucleic acid sequences and amino acid sequences 
can be compared using computer programs that align the 
similar sequences of the nucleic or amino acids thus 
define the differences. For purposes of this invention, 
the DNAStar program (DNAStar, Inc., Madison, Wisconsin) 
and the default parameters used by that program are the 
parameters intended to be used herein to compare sequence 
identity and similarity. Alternately, the Blastn and 
Blastp 2.0 programs provided by the National Center for 
Biotechnology Information (at 

http://www.ncbi.nlm.nih.gov/blast/ ; Altschul et al., 
1990, J Mol Biol 215; 403-410) using a gapped alignment 
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with default parameters, may, be used to determine the 
level of identity and similarity between nucleic acid 
sequences and amino acid sequences. 

The term "substantially the same" refers to 
nucleic acid or amino acid sequences having sequence 
variation that do not materially affect the nature of the 
protein (i.e. the structure, thermostability 
characteristics and/or biological activity of the 
protein) . With particular reference to nucleic acid 
sequences, the term "substantially the same" is intended 
to refer to the coding region and to conserved sequences 
governing expression, and refers primarily to degenerate 
codons encoding the same amino acid, or alternate codons 
encoding conservative substitute amino acids in the 
encoded polypeptide. With reference to amino acid 
sequences, the term "substantially the same" refers 
generally to conservative substitutions and/or variations 
in regions of the polypeptide not involved in 
determination of structure or function. 

The terms "percent identical" and "percent 
similar" are also used herein in comparisons among amino 
acid and nucleic acid sequences. When referring to amino 
acid sequences, ^percent identical" refers to the percent 
of the amino acids of the subject amino acid sequence 
that have been matched to identical amino acids in the 
compared amino acid sequence by a sequence analysis 
program. * Percent similar" refers to the percent of the 
amino acids of the subject amino acid sequence that have 
been matched to identical or conserved amino acids . 
Conserved amino acids are those which differ in structure 
but are similar in physical properties such that the 
exchange of one for another would not appreciably change 
the tertiary structure of the resulting protein. 
Conservative substitutions are defined in Taylor (1986, 
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J. Theor. Biol. 119:205). When referring to nucleic acid 
molecules, ^percent identical" refers to the percent of 
the nucleotides of the subject nucleic acid sequence that 
have been matched. to identical nucleotides by a sequence 
analysis program. 

With respect to protein, the term "isolated 
protein" or "isolated and purified protein" is sometimes 
used herein. This term refers primarily to a protein 
produced by expression of an isolated nucleic acid 
molecule of the invention. Alternatively, this term may 
refer to a protein which has been sufficiently separated 
from other proteins with which it would naturally be 
associated, so as to exist in "substantially pure" form. 

The term "substantially pure" refers to a 
preparation comprising at least 50-60% by weight the 
compound of interest (e.g., nucleic acid, 
oligonucleotide, protein, etc.). More preferably, the 
preparation comprises at least 75% by weight, and most 
preferably 90-99% by weight, the compound of interest. 
Purity is measured by methods appropriate for the 
compound of interest (e.g. chromatographic methods, 
agarose or polyacrylamide gel electrophoresis, HPLC 
analysis, and the like) . 

With respect to antibodies of the invention, 
the term "immunologically specific" refers to antibodies 
that bind to one or more epitopes of a protein of 
interest, but which do not substantially recognize and 
bind other molecules in a sample containing a mixed 
population of antigenic biological molecules. 

With respect to oligonucleotides, the term 
"specifically hybridizing" refers to the association 
between two single-stranded nucleotide molecules of 
sufficiently complementary sequence to permit such 
hybridization under pre-determined conditions generally 
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used in the art (sometimes termed "substantially 
complementary") . In particular, the term refers to 
hybridization of an oligonucleotide with a substantially 
complementary sequence contained within a single-stranded 
DNA or RNA molecule of the invention, to the substantial 
exclusion of hybridization of the oligonucleotide with 
single-stranded nucleic acids of non-complementary 
sequence. 

The term m expression cassette", as used herein, 
comprises 5 f and 3 1 regulatory regions operably linked to 
a coding sequence. The coding sequence may be in the 
sense or antisense orientation with respect to the 5 f 
regulatory region. 

The term "promoter region" refers to the 5' 
regulatory regions of a gene. 

The term * reporter gene" refers to genetic 
sequences which may be operably linked to a promoter 
region forming a transgene, such that expression of the 
reporter gene coding region is regulated by the promoter 
and expression of the transgene is readily assayed. 

The term "selectable marker gene" refers to a 
gene product that when expressed confers a selectable 
phenotype, such as antibiotic resistance, on a 
transformed cell or plant. 

' The term "operably linked" means that the 
regulatory sequences necessary for expression. of the 
coding sequence are placed in the DNA molecule in the 
appropriate positions relative to the coding sequence so 
as to effect expression of the coding sequence. This 
same definition is sometimes applied to the arrangement 
of coding sequences and transcription control elements 
(e.g. promoters, enhancers, and termination elements) in 
an expression vector. 

The term "DNA construct" refers to genetic 
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sequence used to transform plants and generate progeny 
transgenic plants. These constructs may be administered 
to plants in a viral or plasmid vector. Other methods of 
delivery such as Agrobacterium T-DNA mediated 
5 transformation and transformation, using the biolistic 
process are also contemplated to be within the scope of 
the present invention. The transforming DNA may be 
prepared according to standard protocols such as those 
set forth in "Current Protocols in Molecular Biology", 
10 eds. Frederick M. Ausubel et al., John Wiley & Sons, 
1995. 

The term m xenobiotic" refers to foreign 
chemicals or agents not produced or naturally found in 
the organism. The term is commonly used in reference to 
15 toxic or otherwise detrimental foreign chemicals, such as 
organic pollutants or heavy metals. 



20 



II. Description of plPAC 

and its Encoded Pol vep tide 



In accordance with the present invention, a 
nucleic acid encoding a novel ATP-binding-cassette (ABC) 
transporter has been isolated and cloned from plants. 
The nucleic acid is referred to herein as plPAC. 
25 A cDNA clone of the plPAC from AraJbidopsis 

thaliana, an exemplary plPAC of the invention, is 
described in detail herein and its nucleotide sequence is 
set forth in Example 1 as SEQ ID N0:1. This nucleic acid 
molecule is referred to as m ATPAC . It is 36% identical 
30 and 51% similar to human mdrl across the entire sequence. 
It is 51% identical to the atpgpl gene reported by Dudler 
& Hertig (1997, supra) and 50% identical to atpgp2, a 
close homolog of atpgpl, published in the Genbank . 
database. ATPAC protein is 65% similar to atpgpl and 
35 atpgp2 proteins. 
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A partial clone of a plPAC of the invention was 
originally isolated from Brassica napiis via differential 
expression screening of plants grown in the presence or 
absence of the chloride channel blocker/ 5-nitro-2- (3- 
phenylpropylamino) benzoic acid (NPPB). A 0,5 kb gene 
fragment was identified, which had been up- regulated in 
response to NPPB treatment. This cDNA fragment was used . 
to screen an Arabidopsis cDNA library, from which the 
complete ATPAC clone was isolated. The isolation and 
characterization of ATPAC is described in Example 1. 

A genomic clone of ATPAC (SEQ ID NO: 10) has 
also been isolated from a bacterial artificial chromosome 
(BAC) library of the Arabidopsis genome (BAC clone IGF 
F3J22, obtained from the Arabidopsis stock center, Ohio 
State University) . A 7 kb fragment containing part of 
ATPAC and additional 5 f regulatory sequences was 
subcloned into a plasmid vector (pBluescript) . A 
restriction map of ATPAC is found in Fig. 3. The 
corresponding cDNA clone of ATPAC is found in SEQ ID NO:l 
and its restriction map is Fig. 4. 

Among the unique features of this nucleic acid 
molecule as compared with other mdr-like genes from 
plants are its inducibility by certain compounds, 
including NPPB and herbicides, and its preferential 
expression in roots. The promoter regulatory region of 
ATPAC comprises residues 1-3429 of SEQ ID N0:10. 

Although the ATPAC cDNA clone from Arabidopsis 
thaliana is described and exemplified herein, this 
invention is intended to encompass nucleic acid sequences 
and proteins from other plant species that are 
sufficiently similar to be used instead of ATPAC nucleic 
acid and proteins for the purposes described below. 
These include, but are not limited to, allelic variants 
and natural mutants of SEQ ID NO:l, which are likely to 
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be found in different species of plants or varieties of 
Arabidopsis. 

Because such variants are expected to possess 
certain differences in nucleotide and amino acid 
5 sequence, this invention provides an isolated plPAC 
nucleic acid molecule having at least about 60% 
(preferably 70% and more preferably over 80%) sequence 
homology in the coding regions with the nucleotide 
sequence set forth as SEQ ID NO:l or SEQ ID NO: 10 (and, 

10 most preferably, specifically comprising the coding 

region of SEQ ID NO:l). Also provided are nucleic acids 
that encode a polypeptide that is at least about 40% 
(preferably 50% and most preferably 60%) similar to 
residues 1-76, 613-669 or 1144-1161 of SEQ ID NO:2. Also 

15 provided are nucleic acids that hybridize to the nucleic 
acids of SEQ ID N0:1, SEQ ID NO: 10, or nucleic acids 
encoding the regions of residues 1-76, 613-669 or 1144- 
1161 of SEQ ID NO: 2, preferably under moderate stringency 
(more preferably, high stringency, and most preferably, 

20 very high stringency) . 

In other preferred embodiments, the nucleic 
acids have a restriction digest map that is identical for 
at least 3 enzymes (more preferably 6 enzymes and most 
preferably .9 enzymes)' to the maps shown in Figs. 3 or 4. 

25 In another preferred embodiment, the nucleic acids have a 
restriction digest map identical to those shown in Fig. 3 
for enzymes Xhol, Xcml and Spel (preferably additionally 
Sad, Pad and Bsal, and most preferably additionally 
Ac! I, BanI and SnaBI) . In another preferred embodiment, 

30 the nucleic acids have a restriction digest map identical 
to those shown in Fig. 4 for enzymes Xbal, TatI and Neil 
(preferably additionally Dral, BsmI and Bell, and most 
preferably additionally AccI, Bsgl and Tlil) . The 
nucleic acids of the invention are at least 20 nucleic 
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acids in length (preferably at least 50 nucleic acids and 
most preferably at least 100 nucleic acids) . . 

In accordance with the invention, novel plPAC 
genes from two plant species, Brassica napus and 
5 Arabidopsis thaliana, are presented. This constitutes 
the first description of this unique p-glycoprotein in 
plants. Indeed, the closest known protein sequence, also, 
from Arabidopsis, is only 65% identical suggesting that 
the ATPAC gene is novel and is expected to have novel 

10 properties. The isolation of two plPAC genes from 

different species enables the isolation of further plPAC 
genes from other plant species. Isolated nucleic acids 
that are plPAC genes from any plant species are 
considered part of the instant invention. In particular, 

15 the nucleic acids of other piPAC genes can be isolated 
using sequences of ATPAC that distinguish plPAC genes 
from other plant mdr genes according to methods that are 
well known to those in the art of gene isolation. In 
particular, sequences that encode residues 1-7 6, 613-669 

20 and 1144-1161 of SEQ ID N0:2 can be used. In a preferred 
embodiment, the plPAC gene is from any higher plant 
species (more preferred from a dicot species, and most 
preferred from a species in Brassicaceae (or 
Cruciferae) ) . 

25 This invention also provides isolated 

polypeptide products of the open reading frames of SEQ ID 
NO:l or SEQ ID NO: 10, having at least about 70% 
(preferably 80% and most preferably 90%) sequence 
identity, or at least about 80% similarity (preferably 

30 90% and more preferably 95%) with the amino acid sequence 
of SEQ ID NO: 2. In another embodiment, the polypeptides 
of the invention are at least about 40% identical 
(preferably 50%, and most preferably 60%) to the regions 
of residues l-7£, 613-669 or 1144-1161 of SEQ ID NO: 2. 
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Because of the natural sequence variation likely to exist 
among plPAC genes, one skilled in the art would expect to 
find up to about 30-40% nucleotide sequence variation, 
while still maintaining the unique properties of the 
5 plPAC gene and encoded polypeptide of the present 

invention. Such an expectation is due in part to the 
degeneracy of the genetic code, as well as to the known 
evolutionary success of conservative amino acid sequence 
variations, which do not appreciably alter the nature of 

10 the encoded protein. Accordingly, such variants are 

considered substantially the same as one another and are 
included within the scope of the present invention. 

Also provided are transgenic plants transformed 
with part or all of the nucleic acids of the invention. 

15 Transgenic plants that over-express a plPAC coding 
sequence are one embodiment of this aspect of the 
invention. Example 3 provides for one prototype of such 
a plant. In a preferred embodiment, the ATPAC gene is 
used, and in a most preferred embodiment SEQ ID NO:l or 

20 SEQ ID NO: 10 is used. The plPAC gene may be placed under 
a powerful constitutive promoter, such as the Cauliflower 
Mosaic Virus (CaMV) 35S promoter or the figwort mosaic 
virus 35S promoter. In a preferred embodiment, the 
35SCaMV promoter is used. Transgenic plants expressing 

25 the plPAC gene under an inducible promoter (either its 
own promoter or a heterologous promoter) are also 
contemplated to be within the scope of the present 
invention. Inducible plant promoters include the 
tetracycline repressor/operator controlled promoter. In 

30 a preferred embodiment, a native plPAC promoter is used, 
and in a most preferred embodiment, residues 1-3429 of 
SEQ ID NO: 10 is used. Plant species that are 
contemplated for overexpression of a plPAC coding 
sequence include, but are not limited to, soybean. 
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In another embodiment, overexpression of plPAC 
is induced to generate a co-suppression effect. This 
excess expression serves to promote down-regulation of 
both endogenous and exogenous plPAC genes* 
5 In some instances, it may be desirable to down- 

regulate or inhibit expression of endogenous plPAC in 
plants possessing the gene. Accordingly, plPAC nucleic 
acid molecules, or fragments thereof, may also be 
utilized to control the production of plPAC-encoded P- 

10 glycoproteins. In one embodiment, full-length plPAC 
antisense molecules or antisense oligonucleotides, 
targeted to specific regions of pJPAC-encoded RNA that 
are critical for translation, are used. The use of 
antisense molecules to decrease expression levels of a 

15 pre-determined gene is known in the art. In a preferred 
embodiment, antisense molecules are provided in situ by 
transforming plant cells with a DNA construct which, upon 
transcription, produces the antisense sequences. Such 
constructs can be designed to produce full-length or 

20 partial antisense sequences. One example of antisense 
plPAC transgenic plants is given in Example 3. 

In another embodiment, knock-out plants are 
obtained by screening a T-DNA mutagenized plant 
population for insertions in the plPAC gene (see krysan 

25 et al., 1996, PNAS 93:8145). One example of this 
embodiment of the invention is found in Example 3. 
Optionally, transgenic plants can be created containing 
mutations in the region encoding the active site of 
plPAC. These last two embodiments are preferred over the 

30 use of anti-sense constructs due to the high homology 
among P-glycoproteins . 

The promoter of ATPAC is also provided in 
accordance with the invention. This promoter has the 
useful properties of root expression and inducability by 
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NPPB. The prototypic example of this aspect of the 
invention is residues 1-3429 of SEQ ID NO: 10. It is 
anticipated that plPAC genes from other plant species 
will likewise exhibit the aforementioned useful 
5 properties. As these promoter regions can easily be 

isolated from the plPAC genes that are provided with the 
invention, all plant plPAC gene promoters are provided 
with the invention. The nucleic acids of the invention 
therefore include a nucleic acid molecule that is at 

10 least about 70% identical (preferably 80% and most 

preferably 90%) to the residues 1-3429 of SEQ ID NO: 10. 
Also provided are nucleic acids that hybridize to the 
nucleic acid residues 1-3429 of SEQ ID NO: 10 preferably 
under moderate stringency (more preferably, high 

15 stringency, and most preferably, very high stringency) . 

The present invention also provides antibodies 
capable of immuno-specifically binding to polypeptides of 
the invention. Polyclonal or monoclonal antibodies 
directed toward any of the peptides encoded by plPAC may 

20 be prepared according to standard methods. Monoclonal 
antibodies may be prepared according to general methods 
of Kohler and Milstein, following standard protocols. In 
a preferred embodiment, antibodies are prepared, which 
react immuno-specifically with various epitopes of the 

25 piPAC-encoded polypeptides. In a preferred embodiment, 
the antibodies are immunologically specific to the 
polypeptide of residues 1-76, 613-669 or 1144-1161 of SEQ 
ID NO:2. 

The following description sets forth the 
30 general procedures involved in practicing the present 
invention. To the extent that specific materials are 
mentioned, it is merely for purposes of illustration and 
is not intended to limit the invention. Unless otherwise 
specified, general cloning procedures, such as those set 
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forth in Sambrook et al., Molecular Cloning , Cold Spring 
Harbor Laboratory (1989) (hereinafter "Sambrook et al. n ) 
or Ausubel et al. (eds) Current Protocols in Molecular 
Biology , John Wiley & Sons (1997) (hereinafter m Ausubel 
5 et al.") are used. 



III. Preparation of P1PAC Nucleic Acid Molecules, 

encoded Polypeptides, Antibodies Specific for the 
Polypeptides and Transgenic Plants 



10 



1. Nucleic Acid Molecules 

P1PAC nucleic acid molecules of the invention 
may be prepared by two general methods: (1) they may be 

15 synthesized from appropriate nucleotide triphosphates, or 
(2) they may be isolated from biological sources. Both 
methods utilize protocols well known in the art. 

The availability of nucleotide sequence 
information/ such as the cDNA having SEQ ID NO:l, enables 

20 preparation of an isolated nucleic acid molecule of the 
invention by oligonucleotide synthesis ♦ Synthetic 
oligonucleotides may be prepared by the phosphoramadite 
method employed in the Applied Biosys terns 38A DNA 
Synthesizer or similar devices. The resultant construct 

25 may be purified according to methods known in the art, 
such as high performance liquid chromatography (HPLC) . 
Long, double-stranded polynucleotides, such as a DNA 
molecule of the present invention, must be synthesized in 
stages, due to the size limitations inherent in current 

30 oligonucleotide synthetic methods. Thus, for example, a 
long double-stranded molecule may be synthesized as 
several smaller segments of appropriate complementarity. 
Complementary segments thus produced may be. annealed such 
that each segment possesses appropriate cohesive termini 

35 for attachment of an adjacent segment. Adjacent segments 
may be ligated by annealing cohesive termini in the 
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presence of DNA ligase to construct an entire long 
double-stranded molecule. A synthetic DNA molecule so 
constructed may then be cloned and amplified in an 
appropriate vector. 

P1PAC genes also may be isolated from 
appropriate biological sources using methods known in the 
art. In fact, the ATPAC clone was isolated from an 
Arabidopsis cDNA library using a partial clone obtained 
from Brassica napus. In alternative embodiments, genomic 
clones of plPAC may be isolated. 

In accordance with the present invention, 
nucleic acids having the appropriate level sequence 
homology with part or all the coding regions of SEQ ID 
NO:l or SEQ ID NO: 10 may be identified by using 
hybridization and washing conditions of appropriate 
stringency. For example, hybridizations may be 
performed, according to the method of Sambrook et al., 
using a hybridization solution comprising: 5X SSC, 
5X Denhardt's reagent, 1.0% SDS, 100 pg/ml denatured, 
fragmented salmon sperm DNA, 0.05% sodium pyrophosphate 
and up to 50% formamide. -Hybridization is carried out at 
37-42°C for at least six hours. Following hybridization, 
filters are washed as follows: (1) 5 minutes at room 
temperature in 2X SSC and 1% SDS; (2) 15 minutes at room 
temperature in 2X SSC and 0.1% SDS; (3) 30 minutes-1 
hour at 37 °C in 2X SSC and 0.1% SDS; (4) 2 hours at 45- 
55°in 2X SSC and 0.1% SDS, changing the solution every 30 
minutes. 

One common formula for calculating the 
stringency conditions required to achieve hybridization 
between nucleic acid molecules of a specified sequence 
homology (Sambrook et al., 1989): 

T m = 81.5°C + 16.6Log [Na+] + 0.41(% G+C) - 0.63 (% fonnanride) - 600/#bp in duplex 
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As an illustration of the above formula, using [N+] = 
[0.368] and 50% formamide, with GC content of 42% and an 
average probe size of 200 bases, the T n is 57°C. The T B 
of a DNA duplex decreases by 1 - 1.5°C with every 1% 
5 decrease in homology. Thus, targets with greater than 
about 75% sequence identity would be observed using a 
hybridization temperature of 42 °C, 

The stringency of the hybridization and wash 
depend primarily on the salt concentration and 

10 temperature of the solutions. In general, to maximize 
the rate of annealing of the probe with its target, the 
hybridization is usually carried out at salt and 
temperature conditions that are 20-25 °C below the 
calculated T n of the of the hybrid. Wash conditions 

15 should be as stringent as possible for the degree of 

identity of the probe for the target. In general, wash 
conditions are selected to be approximately 12-20°C below 
the T m of the hybrid. In regards to the nucleic acids of 
the current invention, a moderate stringency 

20 hybridization is defined as hybridization in 6X SSC, 5X 
Denhardt's solution, 0.5% SDS and 100 ]ig/ml denatured 
salmon sperm DNA at 42°C, and wash in 2X SSC and 0.5% SDS 
at 55°C for 15 minutes. A high stringency hybridization 
is defined as hybridization in 6X SSC, 5X Denhardt's 

25 solution, 0.5% SDS and 100 jig/ml denatured salmon sperm * 
DNA at 42°C, and wash in IX SSC and 0.5% SDS at 65°C for 
15 minutes. A very high stringency hybridization is 
defined as hybridization in 6X SSC, 5X Denhardt's 
solution, 0.5% SDS and 100 pg/ml denatured salmon sperm 

30 DNA at 42°C, and wash in 0.1X SSC and 0.5% SDS at 65°C 
for 15 minutes. 

Nucleic acids of the present invention may be 
maintained as DNA in any convenient cloning vector. In a 
preferred embodiment, clones are maintained in plasmid 
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cloning/expression vector, such as pGEM-T (Promega 
Biotech, Madison, WI) or pBluescript (Stratagene, La 
Jolla, CA) , either of which is propagated in a suitable 
E. coii host cell. 

P1PAC nucleic acid molecules of the invention 
include cDNA, genomic DNA, RNA, and fragments thereof 
which may be single- or double-stranded. Thus, this 
invention provides oligonucleotides (sense or antisense 
strands of DNA or RNA) having sequences capable of 
hybridizing with at least one sequence of a nucleic acid 
molecule of the present invention, such as selected 
segments of SEQ ID N0:1 or SEQ ID NO: 10. Such 
oligonucleotides are useful as probes for detecting plPAC 
genes or mRNA in test samples, e.g. by PCR amplification, 
mapping of genes or for the positive or negative 
regulation of expression of plPAC genes at or before 
translation of the mRNA into proteins. 

The plPAC promoter is also expected to be 
useful in connection with the present invention, inasmuch 
as it is inducible in plants upon exposure to anion 
channel blockers. As mentioned above, seven-kilobase 
fragment of genomic DNA has been isolated, which contains 
part or all of the plPAC promoter from Arabidopsis 
thaliana. This promoter can be used in chimeric gene 
constructs to facilitate inducible expression of any 
coding sequence of interest, upon exposure to NPPB or 
similar-acting compounds. 

2. Proteins 

Polypeptides encoded by plPAC nucleic acids of 
the invention may be prepared in a variety of ways, 
according to known methods. If produced in situ the 
polypeptides may be purified from appropriate sources, 
e.g., plant roots or other plant parts. 
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Alternatively, the availability of nucleic acid 
molecules encoding the polypeptides enables production of 
the proteins using in vitro expression methods known in 
the art. For example, a cDNA or gene may be cloned into 
an appropriate in vitro transcription vector, such a 
pSP64 or pSP65 for in vitro transcription, followed by 
cell-free translation in a suitable cell-free translation . 
system, such as wheat germ or rabbit reticulocytes. In 
vitro transcription and translation systems are 
commercially available, e.g., from Promega Biotech, 
Madison, Wisconsin or BRL, Rockville, Maryland. 

According to a preferred embodiment/ larger 
quantities of plPAOencoded polypeptide may be produced 
by expression in a suitable procaryotic or eucaryotic 
system. For example, part or all of a DNA molecule, such 
as the cDNA having SEQ ID NO:l, may be inserted into a 
plasmid vector adapted for expression in a bacterial cell 
{such as E. coli) or a yeast cell (such as Saccharomyces 
cerevisiae) , or into a baculovirus vector for expression 
in an insect cell. Such vectors comprise the regulatory 
elements necessary for expression of the DNA in the host 
cell, positioned in such a manner as to permit expression 
of the DNA in the host cell. Such regulatory elements 
required for expression include promoter sequences, 
transcription initiation sequences and, optionally/ 
enhancer sequences. 

The plPAC polypeptide produced by gene 
expression in a recombinant procaryotic or eucyarotic 
system may be purified according to methods known in the 
art. In a preferred embodiment, . a commercially available 
expression/secretion system can be used, whereby the 
recombinant protein is expressed and thereafter secreted 
from the host cell, to be easily purified from the 
surrounding medium. If expression/secretion vectors are 
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not used, an alternative approach involves purifying the 
recombinant protein by affinity separation, such as by 
immunological interaction with antibodies that bind 
specifically to the recombinant protein. Such methods 
5 are commonly used by skilled practitioners. 

The plPAC-encoded polypeptides of the 
invention, prepared by the aforementioned methods, may be 
analyzed according to standard procedures. 

10 C. Transgenic Plants 

Transgenic plants expressing the plPAC gene can 
be generated using standard plant transformation methods 
known to those skilled in the art. These include, but 
are not limited to, Agrobacterium vectors, PEG treatment 

15 of protoplasts, biolistic DNA delivery, UV laser 

microbeam, gemini virus vectors, calcium phosphate 
treatment of protoplasts, electroporation of isolated 
protoplasts, agitation of cell suspensions with 
microbeads coated with the transforming DNA, direct DNA 

20 uptake, liposome-mediated DNA uptake, and the like. Such 
methods have been published in the art. See, e.g., 
Methods for Plant Molecular Biology (Weissbach & 
Weissbach, eds., 19881: Methods in Plant Molecular 
Biology (Schuler & Zielinski, eds., 1989); Plant 

25 Molecular Biology Manual (Gelvin, Schilperoort, Verma, 
eds., 1993); *nH MPthnris in Plant Molecular Biology - A 
Lahoratorv Manual (Maliga, Klessig, Cashmore, Gruissem & 
Varner, eds., 1994) . 

The method of transformation depends upon the 

30 plant to be transformed. The biolistic DNA delivery 

method is useful for nuclear transformation. In another 
embodiment of the invention, Agrobacterium vectors are 
used to advantage for efficient transformation of plant 
nuclei. 
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In a preferred embodiment, the gene is 
introduced into plant nuclei in Agrobacterium binary 
. vectors. Such vectors include, but are not limited to, 
BIN19 (Bevan, 1984, Nucleic Acid Res 12: 8711-8721) and 
5 derivatives thereof, the pBI vector series (Jefferson et 
al., 1987, PNAS 83:8447-51), and binary vectors pGA482 
and pGA492 (An, 1986) and others (for review, see An, 
1995, Methods Mol Biol 44:47-58) . In preferred 
embodiments, the pPZP211 vector (Hajdukiewicz et al., 

10 1994, PMB 25:989-994) or PCGN7366 (Calgene, CA) are used. 
DNA constructs for transforming a selected plant comprise 
a coding sequence of interest operably linked to 
appropriate 5 1 (e.g., promoters and translational 
regulatory sequences) and 3' regulatory sequences (e.g., 

15 _ terminators). 

Using an Agrobacterium binary vector system for 
transformation, the plPAC coding region, under control of 
a constitutive or inducible promoter as described above, 
is linked to a nuclear drug resistance marker, such as 

20 kanamycin resistance. Agrobacteriuro-mediated 

transformation of plant nuclei is accomplished according 
to the following procedure: 

(1) the gene is inserted into the selected 
Agrobacterium binary vector; 

25 (2) transformation is accomplished by co- 

cultivation of plant tissue (e.g., leaf discs) with a 
suspension of recombinant Agrobacterium, followed by 
incubation (e.g., two days) on growth medium in the 
absence of the drug used as the selective medium (see, 

30 e.g., Horsch et al. 1985, Cold Spring Harb Symp Quant 
Biol. 50:433-7); 

(3) plant tissue is then transferred onto the 
selective medium to identify transformed tissue; and 

(4) identified transf ormants are regenerated 
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to intact plants. 

It should be recognized that the amount of 
expression, as well as the tissue specificity of 
expression of the plPAC gene in transformed plants can 
vary depending on the position of their insertion into 
the nuclear genome. Such position effects are well known 
in the art. For this reason, several nuclear 
trans formants should be regenerated and tested for 
expression of the transgene. 

IV. Uses of PlPAC Nucleic Acids, Encoded Proteins 
and Antibodies 

1. PlPAC Nucleic Acids 

15 PlPAC nucleic acids may be used for a variety 

of purposes in accordance with the present invention. 
The DNA, RNA, or fragments thereof may be used as probes 
to detect the presence of and/or expression of plPAC 
genes. Methods in which plPAC nucleic acids may be 

20 utilized as probes for such assays include, but are not 
limited to: (1) in situ hybridization; {2} Southern 
hybridization (3) northern hybridization; and (4) 
assorted amplification reactions such as polymerase chain 
reactions (PCR) • 

25 The plPAC nucleic acids of the invention may 

also be utilized as probes to identify related genes from 
other plant species. As is well known in the art and 
described above, hybridization stringencies may be 
adjusted to allow hybridization of nucleic acid probes 

30 with complementary sequences of varying degrees of 
homology. Thus, plPAC nucleic acids may be used to 
advantage to identify and characterize other genes of 
varying degrees of relation to the exemplary ATPAC, 
thereby enabling further characterization of this family 
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of genes in plants. Additionally/ they may be used to 
identify genes encoding proteins that interact with the 
P-glycoprotein encoded by plPAC (e.g., by the 
"interaction trap" technique) . 

2. PlPAC Proteins and Antibodies 

Purified pi PAC- encoded P-glycoproteins, or 
fragments thereof, may be used to produce polyclonal or 
monoclonal antibodies which also may serve as sensitive 
detection reagents for the presence and accumulation of 
plant P~ glycoproteins in cultured plant cells or tissues 
and in intact plants. Recombinant techniques enable 
expression of fusion proteins containing part or all of 
the plPAC-encoded protein. The full length protein or 
fragments of the protein may be used to advantage to 
generate an array of monoclonal or polyclonal antibodies 
specific for various epitopes of the protein, thereby 
providing even greater sensitivity for detection of the 
protein in cells or tissue. 

Polyclonal or monoclonal antibodies 
immunologically specific for plPAOencoded proteins may 
be used in a variety of assays designed to detect and 
quantitate the protein. Such assays include, but are not 
limited to: (1) flow cytometric analysis; (2) 
immunochemical localization in cultured cells or tissues; 
and (3) immunoblot analysis (e.g., dot blot, Western 
blot) of extracts from various cells and tissues. 

Polyclonal or monoclonal antibodies that 
immunospecif ically interact with one or more of the 
polypeptides encoded by plPAC can be utilized for 
identifying and purifying such proteins. For example, 
antibodies may be utilized for affinity separation of 
proteins with which they immunospecif ically interact. 
Antibodies may also be used to immunoprecipitate proteins 
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from a sample containing a mixture of proteins and other 
biological molecules. 

3. pIPAC Transgenic Plants 
5 Transgenic plants that over- or under- express 

pIPAC can be used in a varied of agronomic and research 
applications. From the foregoing discussion, it can be 
seen that pIPAC and its homo logs, and transgenic plants 
containing them will be useful for improving stress 

10 resistance or tolerance in plants. This provides an 

avenue for developing marginal or toxic soil environments 
for crop production. Both over- and under-expressing 
pIPAC transgenic plants have great utility in the 
research of herbicides and other xenobiotic compounds. 

15 As discussed above and in greater detail in 

Example 1, the similarity between plant and mammalian mdr 
genes indicates that their functional aspects will also 
be conserved. Thus, pIPAC is expected to play an 
important, role in the exclusion of toxic metabolic or 

20 xenobiotic compounds from cells. The fact that pIPAC 
also is inducible and appears to be preferentially 
expressed in roots, where contact with such compounds 
often occurs, makes pIPAC particularly desirable for 
genetic engineering of plants to increase their tolerance 

25 to such: compounds. Accordingly/ plants engineered to 

overexpress the pIPAC gene should be resistant to a wide 
range of chemicals, both intentionally applied as 
herbicides or unintentionally as wastes. Examples of the 
kinds of xenobiotics that should be detoxified by the 

30 pIPAC of the invention include, but are not. limited to, 
hydrophobic (i.e., lipophilic) herbicides and other 
compounds, such as 3 (3, 4-dichlorophenyl) -1, 1, dimethyl 
urea (also known as DCMU or Diuron, available from Sigma 
Chemical Co., St. Louis, MO) or other hydrophobic 
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compounds that disrupt photosynthetic electron transport, 
as well as Metachlor (Ciba Geigy, Basel Switzerland) , 
Taurocholate (Sigma Chemical Co. ) , Primisulfuron (Ciba 
Geigy) , and IRL-1803. 
5 As illustrated in Example 2, plant cells that 

over-express a pi PAC gene have surprisingly higher growth 
rate with or without the xenobiotic compound Rhodamine 
6G. It is contemplated that plPAC overexpression may be 
a generally useful way to increase plant and plant cell 
10 culture growth, even without the presence of xenobiotic 
compounds. 

The following specific examples are provided to 
illustrate embodiments of the invention. They are not 
15 intended to limit the scope of the invention in any way. 



EXAMPLE 1 

20 Cloning and Analysis of a 

PlPAC From Arabldovsls thallana 

The pi PAC of the present invention was 
identified by its up-regulation in response to a chloride 

25 ion channel blocker. Bxassica napus plants were grown 
either in the presence or absence of 20 pM 5-nitro-2- (3- 
phenylpropyl amino) benzoic acid (NPPB) . After five days, 
the roots of the seedlings were harvested and total RNA 
was extracted separately from the treated and untreated 

30 plants. From the total RNA preparations, poly (A) + RNA 
was isolated and used as the starting material to create 
a cDNA subtraction library, using the CLONTECH PCR- 
SELECT™ cDNA Subtraction Kit and accompanying 
instructions (CLONTECH Laboratories, Inc., Palo Alto, 

35 CA) . 
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Using the subtractive hybridization kit, a gene 
fragment was identified that was up-regulated in response 
to treatment of the plants with NPPB. This fragment (0.5 
kb) was used to screen a cDNA. library of Arabidopsis 
5 thaliana, from which a full-length cDNA clone was 

isolated- The nucleotide sequence of this cDNA clone, 
referred to as ATPAC {Arabidopsis thaliana putative anion . 
channel) is set forth below as SEQ ID NO:l. 

The 3.76 kb cDNA clone encodes a polypeptide 

10 1,254 amino acids in length. The deduced amino acid 

sequence encoded by SEQ ID NO:l is shown in Figure 1 as 
*atpac" (SEQ ID NO:2), in a lineup with the following 
sequences: (1) hmdrl (SEQ ID NO:3); (2) mmdrl (SEQ ID 
NO:4); (3) hmdr3 (SEQ ID NO:5); (4) mmdr2 (SEQ ID N0:6); 

15 (5) atpgpl (SEQ IDNO:7); and (6) atpgp2 (SEQ ID NO:8). 
A consensus sequence (SEQ ID NO: 9) is also shown. 

A search of various sequence databases 
indicates that ATPAC is a new and distinct member of the 
mdr family of ABC transporters. In none of the 

20 databases, including the EST collection, does an exact 
match exist. The ABC transporter family is very large, 
consisting of at least two sub-groups, mrp and homologs 
and mdr and homologs. The only examples of plant mdr- 
like genes are atpgpl and atpgp2 from A. thaliana and two 

25 homologs from potato and barley, respectively. Though 
the atpgp2 and atpgp2 genes are similar to ATPAC, they 
are only 51 and 50% identical, respectively, indicating 
that ATPAC is a distinct gene by comparison. Sequence 
homology with the potato and barley rodr-like genes is 

30 even more divergent. Another difference between the 
agpgpl gene and the ATPAC gene is their respective 
preferential expression in inflorescens and roots, 
respectively. 
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EXAMPLE 2 

Effect of ATPAC Expression in Bacterial Cells 
5 on Their Ability to Detoxify Rhodamine 6G 

The compound Rhodamine 6G is a well known 
substrate of mammalian p-glycoproteins (Kolaczkowski et 
al., J. Biol. Chem. 271: 31543-31548, 1996). The ability 

10 of a cell to detoxify the compound is indicative of 

activity of p-glycoproteins. A bacterial cell line was 
transformed with an expression vector comprising ATPAC. 
The growth rate of transformed and non-transformed cells 
was then measured/ in the presence or absence of 

15 Rhodamine 6G. Results are shown in Figure 2. As can be 
seen, ATPAC-expressing cells grown in the absence of the 
drug had the best growth rate. Moreover, even in the 
presence of the drug, the cells grew more quickly than 
non- trans formed cells in the presence or absence of 

20 Rhodamine 6G. These results demonstrate that ATPAC 
encodes a functional and robust p-glycoprotein. 

Example 3 
Transgenic Plants the Overexpress 
25 and Underexpress ATPAC 

Transformation construct. The AgroJbacterium 
binary vector pPZP211 (Hajdukiewicz et al., 1994 Plant 
Mol. Biol. 25:989-994) was digested with EcoRl and Smal, 

30 and self-ligated. This molecule was named pPZP211'. The 
AgroJbacterium binary vector pCGN7366 (Calgene, CA) was 
digested with Xhol and cloned in Sail-digested pPZP211' . 
We named this binary vector pPZP-PCGN. The 3.8 kb full- 
length ATPAC cDNA was cloned into the pGH19 vector. 

35 After digestion with Smal (in the multiple cloning site 
upstream) and £co«RI, a 3.1 kb cDNA fragment was cut out. 
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This Smal-EcoRI 3.1 kb fragment was cloned into the 
Smal/EcoRI site of pPZP-pCGN. The rest of ATPAC gene was 
amplified using polymerase chain reaction to have 
translationally fused HA-tag at its 3 '-terminal. After 
5 ligating EcoRl linkers to the ends of the resulting PCR 
product, the 0.7 kb fragment was cloned into the EcoRI 
site of the Smal-EcoRI 3.1 kb ATPAC fragment in pPZP- 
pCGN. The final construct was named pATPAC-OE. 

Plant transformation. pATPAOOE was introduced 
10 into Agrobacterivm tumefaciens strain by a direct 
transformation method. Agrobacteri urn-mediated 
transformation was performed using vacuum infiltration 
(Bechtold et al., 1993,. CR Acad. Sci. [Ill] 316: 
1194-1199.) 

15 Tl plants which survived on 

kanamycin-containing plates were selected, transplanted 
into soil and grown to set T2 seed. T3 seeds were 
collected from kanamycin-resistant T2 plants. T3 plants 
which showed 100% kanamycin-resistance were selected and 

20 were considered homozygous for the transgene. 

Antisense Plants. The full length cDNA in 
pBluescript SK(-) vector (Stratagene, CA) is digested 
with EcoRI (there is a cleavage site in the upstream 

25 polylinker) and Sspl. The resulting 1.3 Kb fragment 

representing a 5' portion of the AtPAC cDNA was cloned 
into the aforementioned pPZP-PCGN, which had been 
digested with EcoRI/ Smal, ensuring that this fragment of 
the cDNA was inserted in the antisense orientation. This 

30 construct was named pATPAC-AE. pATPAC-AE was introduced 
into Arabidopsis plants by Agrobacterium transformation, 
as described above. 
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Knock-out Plants. The method of Krysan et al 
(1996, PNAS 93:8145, incorporated by reference herein) 
was followed using the following primers: 
Gene-specific primers: 
5 AtpacF: CACTGCTCAATGATCTCGTTTTCTCACTA (SEQ ID NO: 11) 
AtpacR: CTTGAATCACACCAATGCAATCAACACCTC (SEQ ID NO: 12) 
Primers for T-DNA left boarder: 

JL202: CATTTTATAATAACGCTGCGGAGATCTAC (SEQ ID NO: 13) 
JL270: TTTCTCCATATTGACCATCATACTCATtG (SEQ ID NO: 14) 

10 

While certain of the preferred embodiments of 
the present invention have been described and 
specifically exemplified above, it is not intended that 
the invention be limited to such embodiments. Various 
15 modifications may be made thereto without departing from 
the scope and spirit of the present invention, as set 
forth in the following claims. 
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What is claimed: 

1. A nucleic acid isolated from a plant, which 
encodes a p-glycoprotein that is inducible by exposure of 
the plant to NPPB. 

2. The isolated nucleic acid of claim 1, which . 
is preferentially expressed in plant roots upon exposure 
of the plant to NPPB. 

3. The isolated nucleic acid of claim 1, 
wherein the plant is selected from the group consisting 
of Brassica napus and Arabidopsis thaliana and is 3850- 
4150 nucleotides long. 

4. The isolated nucleic acid of claim 1, which 
has the restriction sites shown in Figure 4 for at least 
three enzymes. 

20 5. The isolated nucleic acid of claim 4, which 

encodes a polypeptide having SEQ ID NO: 2. 

6, The isolated nucleic acid of claim 5, which 
is a cDNA comprising a coding region selected from the 

25 group consisting of SEQ ID N0:1 and SEQ ID NO: 10. 

7. An isolated protein, which is a product of 
expression of part or all of the isolated nucleic acid 
molecule of claim 1. 



15 



30 



8. Antibodies immunologically specific for the 
protein of claim 7. 

9. A expression cassette, which comprises a 
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plPAC gene coding sequence operably linked to a promoter. 



10. The expression cassette of claim 9, which 
comprises a pi P AC gene from Arabidopsis thaliana. 

5 

11. The expression cassette' of claim 10, in 
which the promoter is the cauliflower mosaic virus 35S 
promoter. 

10 12. The expression cassette of claim 10, in 

which the plPAC gene is part or all of SEQ ID N0:1 or SEQ 
ID NO: 10. 

13. A vector comprising the expression 
15 cassette of claim 9. 

14. The vector of claim 13, which is comprised 
of an Agrobacterium binary vector selected from the group 
consisting of pPZP211 and pCGN7366. 

20 

15. A method for producing a plant with 
enhanced resistance to xenobiotic compounds by 
transforming in vitro the plant with the expression 
cassette of claim 9. 

25 

16. The method of claim 15, wherein the 
transformation step further uses the vector of claim 13. 

17. A transgenic plant produced by the method 
30 of claim 15. 

18. A reproductive unit form the transgenic 
plant of claim 17. 
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■■19- A cell from the transgenic plant of claim 

17. 

20. A recombinant DNA molecule comprising the 
nucleic acid molecule of claim 1, operably linked to a 
vector for transforming cells. 

21. A cell transformed with the recombinant 
DNA molecule of claim 20. 

22. The cell of claim 21, selected from the 
group consisting of bacterial cells, yeast cells and 
plant cells. 



15 23. A transgenic plant regenerated from the 

transformed cell of claim 22. 

24. An isolated nucleic acid molecule of at 
least 20 nucleotides in length having a sequence selected 
20 from the group consisting of: 

a) SEQ ID N0:1 and SEQ ID NO: 10; 

b) a nucleic acid sequence that is at least 
about 60% homologous to the coding regions of SEQ ID N0:1 
or SEQ ID NO: 10; 

25 c) a sequence hybridizing with SEQ ID NO:l or 

SEQ ID NO: 10 at moderate stringency; 

d) a sequence encoding part or all of a 
polypeptide having SEQ ID NO: 2; 

e) a sequence encoding an amino acid sequence 
30 that is at least about 70% identical to SEQ ID NO: 2; 

f ) a sequence encoding an amino acid sequence 
that is at least about 80% similar to SEQ ID NO: 2; 

g) a sequence encoding an amino acid sequence 
that is at least about 40% similar to residues 1-76, 613- 
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669 or 1144-1161 of SEQ ID NO: 2; and 

h) a sequence hybridizing at moderate 
stringency to a sequence encoding residues 1-76, 613-669 
or 1144-1161 of SEQ ID N0:2. 

5 

25. A polypeptide produced by expression of the 
nucleic acid sequence of claim 24. 

26. Antibodies immunologically specific for the 
10 polypeptide of claim 24. 

27. An oligonucleotide between about 10 and 
about 100 nucleotides in length, which specifically 
hybridizes at moderate stringency with a portion of the 

15 nucleic acid molecule of claim 24. 

28. A recombinant DNA molecule comprising the 
nucleic acid molecule of claim 24, operably linked to a 
vector for transforming cells. 

20 

29. A cell transformed with the recombinant DNA 
molecule of claim 28. 

30. The cell of claim 29, selected from the 

25 group consisting of bacterial cells, yeast cells and plant 
cells. 

31. A transgenic plant regenerated from the 
cell of claim 30. 



30 



32. An isolated plant p-glycoprotein, which is 
inducible upon exposure of the plant to NPPB. 

33. The p-glycoprotein of claim 32, which 
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confers upon a cell in which it is found resistance to 
Rhodamine 6G. 

34. The p-glycoprotein of claim 33/ which is 

5 preferentially produced in roots upon the exposure to the 
NPPB. 

35. The p-glycoprotein of claim 34, from a 
plant selected from the group consisting of Brassica napus 

10 and AraJbidopsis thaliana. 

36. The p-glycoprotein of claim 35, having an 
amino acid sequence that selected from the group 
consisting of: 

15 a) an amino acid sequence that is at least 80% 

similar to SEQ ID N0:2; 

b) an amino acid sequence that is at least 70% 
identical to SEQ ID NO: 2; 

c) an amino acid sequence that is at least 40% 
20 similar to residues 1-76, 613-669 or 1144-1161 of SEQ ID 

NO:2; and 

d) an amino acid sequence encoded by a nucleic 
acid sequence hybridizing at moderate stringency to a 
amino acid sequence encoding residues 1-76, 613-669 or 

25 1144-1161 of SEQ ID N0:2. 

37. Antibodies immunologically specific for the 
p-glycoprotein of claim 32. 

30 38 The antibodies of claim 35, that are 

immunologically specific to residues 1-76, 613-669 or 
1144-1161 of SEQ ID NO:2. 

39. A plant p-glycoprotein gene promoter which 
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is inducible by NPPB. 

40. The plant p-glycoprotein gene promoter of 
claim 39, that is part or all of residues 1-3429 of SEQ ID 

5 N0:10. 

41. A plant with reduced levels of plPAC 

. protein. 

10 42. The plant of claim 41, wherein the native 

plPAC gene is mutated. 

: 43. The plant of claim 42, wherein the plPAC 
gene is mutated due to the insertion of a T-DNA. 

15 

44. A method for making the plant of claim 42, 
wherein a population of mutated plants are screened using 
at least one of SEQ ID NOS: 11-14 as PCR primers. 

20 45. The method of claim 44, wherein the 

population of plants is mutated by T-DNA insertion. 
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1. 



SEQUENCE LISTING 

<110> Wisconsin Alumni Research Foundation 
Spalding, Edgar P. 
Noh, Bosl 

<120> Xenobiotic Detoxification Gene from 
Plants* 

<130> WARF S212 

<150> 60/101,814 
<151> 1998-09-25 

<160> 14 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 4051 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> misc_feature 

<222> (94).;. |0) 

<223> Translation start codon 

<221> misc_feature 
<222> (3932) ... (0) 
<223> Stop codon 

<400> 1 

cttgaacttc acaaaacaat tgtcagattt tcaagaaaaa ctttataaaa caaaaaacat 60 

ttcattcttt ctctctctct ctctcactgc tcaatgatct cgttttctca ctaaaccaac 120 

tcgtttcttc ttactttctt taactcggat ctacaaaaaa ccatgtcgga aactaacaca 160 

accgatgcca agactgttcc agcagaagca gagaagaaga aagaacagag tttaccattc 240 

tttaaactct tttctttcgc tgataaattt gattatctct taatgttcgt tggttctctt 300 

ggtgccattg ttcatggctc ttccatgcct gtcttctttt tactctttgg tcaaatggtt 360 

aatggatttg gtaaaaacca aatggattta catcaaatgg ttcatgaagt ctctagatat 420 

tctctatatt tcgtctactt gggtttggtc gtttgcttct cttcttacgc agagatagca 480 

tgttggatgt attctggaga aagacaagta gcagcattaa ggaagaaata tcttgaagca 540 

gtattaaaac aagacgttgg gttctttgat actgatgcta gaactggtga cattgtcttt 600 

agtgtttcta ctgatactct tcttgttcaa gatgccatta gtgaaaaggt tggaaacttt 660 

atacattacc tctcaacatt tttggcggga ttagtagttg gatttgtatc agcatggaaa 720 

ttagctttgt taagtgttgc tgtgattccc ggaatcgctt tcgccggagg tttatacgct 780 

tatacactca ccggaattac ttcaaagagc cgtgaatctt atgctaacgc cggtgttatc 840 

gccgagcagg caattgctca agttcgaact gtttattctt atgttggaga gagtaaggca 900 

cttaatgcgt attcggatgc gattcagtat acgcttaagc tcggttataa agcggggatg 960 

gctaaagggt tgggtttagg atgtacttat ggaatagctt gtatgtcatg ggctttggtg 1020 

ttttggtatg ctggagtttt tattcggaat ggacaaaccg atggaggaaa ggcgtttact 1080 

gctatattct ctgctattgt tggtggaatg agtttggggc aatctttctc gaatcttggg 1140 

gcgtttagta aaggtaaagc ggctggttat aagttgatgg agataattaa ccagagaccg 1200 

acgataattc aagacccgtt ggatggaaaa tgtttggatc aagttcatgg gaacattgag 1260 

tttaaagatg tgacttttag ctatccttca cggcctgatg ttatgatctt caggaacttt 1320 

aatattttct tcccttctgg gaaaactgtg gcggttgttg gtgggagtgg ctctggaaag 1380 

agtactgttg tttccctcat tgagagattc tatgatccaa acagcgggca aattctgttg 1440 

gatggtgttg agataaagac gcttcagttg aagtttttgc gtgaacaaat cgggcttgtg 1500 

aatcaagaac ctgcgfctctt tgccactact atactagaga acatactcta tggaaagcct 1560 

gatgcaacaa tggttgaagt tgaagctgct gcttccgctg cgaatgcgca tagtttcatt 1620 

acattacttc ctaaaggcta cgacacacag gttggagaac gtggtgttca actctcaggt 1680 
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2. — 

ggacagaagc agagaattgc aattgctagg gcgatgttga aagacccaaa gattctgtta 1740 

ctagatgaag ctacaagcgc tcttgatgct agctctgaga gcattgttca ggaagcttta 1800 

gacagagtca tggtggggag gaccactgtt gttgttgctc atcgtctctg caccatcaga 1860 

aatgttgatt ccattgccgt gatacagcaa ggccaagttg ttgaaaccgg aacacatgaa 1920 

gaactcattg ccaaatccgg tgcttacgca tccctcatca ggtttcagga aatggttggt 1980 

actcgagatt tctcaaaccc gtcaactcgt cgcactcgtt caacccgttt gagccattca 2040 

ctgtcaacga aatcactcag tttaagatca ggaagtttga ggaatctgag ctattcttac 2100 

agcactggag ctgatggtcg gatagagatg atttcaaatg cagagactga ccgaaagact 2160 

cgtgcccctg aaaattactt ctacaggctt ctcaagctta attcaccgga atggccttac 2220 

tcaatcatgg gagcagtagg ctcaattctt tctggtttca ttggtcctac atttgctatt 2280 

gtgatgagca acatgatcga agtcttctac tacacagact atgattcaat ggaaaggaaa 2340 

acaaaagagt atgtcttcat ctacattggt gctggtctct atgctgtggg tgcttatttg 2400 - 

atccaacatt acttctttag catcatggga gaaaacctca caacaagagt aagaagaatg 2460 

atgctctcag ctatcttgag aaacgaagtt ggttggttcg atgaggatga acacaactca 2520 

agcctgatcg ctgcacgttt agctactgat gcagcagatg ttaaatccgc tatagccgag 2580 

agaatctcag taattctaca aaacatgact tcacttctca catccttcat agtcgccttc 2640 

atagtagaat ggagagtctc acttctcatc ttaggcacat tcccacttct agtcctcgct 2700 

aactttgctc agcaactatc tctgaagggt tttgctggag acacagctaa ggctcatgca 2760 

aagacttcaa tgattgctgg tgaaggagtc agtaacatta gaaccgtagc agctttcaat 2820 

gcacagagca agattctctc tttgttctgt catgagcttc gtgtacctca gaaaagaagc 2880 

ttaagcttat accgaagtca aacctcgggt ttcctatttg gcctctcgca gcttgctctc 2940 

tatggttctg aggctttaat tctctggtat ggtgcccacc ttgtgagtaa aggcgtgtca 3000 

accttttcca aagtgatcaa agtgtttgtg gttttggtca ttactgcaaa ctctgttgct 3060 

gaaactgtca gtcttgctcc tgaaattatt cggggaggtg aagctgttgg ttcggttttc 3120 

tcggtcttgg acaggcagac caggattgac ccggatgatg ctgatgctga tcccgtggag 3180 

acgatccgtg gagacattga gtttaggcat gttgatttcg cttacccttc aagacccgac 3240 

gtcatggttt tcagggactt taacctcaga attcgagctg gacatagcca agctcttgtg 3300 

ggcgcgagtg ggtcagggaa gagttctgta attgcgatga tcgagcggtt ttacgacctt 3360 

cttgctggaa aagtcatgat tgatggcaaa gacatccgcc ggctaaacct gaaatctcta 3420 

aggctcaaaa tcggtcttgt tcaacaagaa ccagctcttt tcgcagcaac gatcttcgac 3480 

aacatcgcct atggtaaaga tggtgcaact gaatccgagg taattgatgc agctcgagcc 3540 

gcaaatgctc acggtttcat cagtggttta cctgaaggtt acaaaactcc agtaggcgaa 3600 

agaggagtgc agttatcagg tggacagaaa cagaggatcg cgatagcaag agctgtgctc 3660 

aagaacccta cagtgttgct tctagacgaa gcaactagcg cactagatgc agaatcagaa 3720 

tgcgtgctgc aagaggcgtt agagaggctc atgagaggtc ggaccaccgt ggtagttgct 3780 

caccgcttgt ccaccataag aggtgttgat tgcattggtg tgattcaaga cgggcggatt 3840 

gtggagcaag gcagccattc agagctcgtt agccgaccag agggagctta ttcaaggctg 3900 

ttacagcttc aaacacatag gatttgaagc ttgatcatgg attaaaaaca aaaaatcggt 3960 

ttgtgtaatt ttttttatat taaaacttta atttggaaga tttctatgga ctataacgat 4020 

aatatgaata ggtgtagata atgaagcttt t 4051 

<210> 2 
<211> 1254 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Ser Glu Thr Asn Thr Thr Asp Ala Lys Thr Val Pro Ala Glu Ala 

1 5 10 15 

Glu Lys Lys Lys Glu Gin Ser Leu Pro Phe Phe Lys Leu Phe Ser Phe 

20 25 30 

Ala Asp Lys Phe Asp Tyr Leu Leu Met Phe Val Gly Ser Leu Gly Ala 

35 40 45 

lie Val His Gly Ser Ser Met Pro Val Phe Phe Leu Leu Phe Gly Gin 

50 55 60 

Met Val Asn Gly Phe Gly Lys Asn Gin Met Asp Leu His Gin Met Val 
65 70 75 80 

His Glu Val Ser Arg Tyr Ser Leu Tyr Phe Val Tyr Leu Gly Leu Val 

85 90 95 

Val Cys Phe Ser Ser Tyr Ala Glu lie Ala. Cys Trp Met Tyr Ser Gly 

100 105 ' 110 

Glu Arg Gin Val Ala Ala Leu Arg Lys Lys Tyr Leu Glu Ala Val Leu 
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Lvs Gin Ast> 


Val 


130 




Val Phe Ser 


Val 


145 




Glu Lvs Val 


Glv 
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Leu Val Val 


Gly 




180 


Ala Val Tie 

Aia v a i a a c 


Pro 


195 




lie IX XXXI wXjr 


lie 


210 




Val Tie Ala 
vax lie /u.a 


Glu 






Val Gl v Glu 

VOX Ui y V31U 


Ser 


Thr Leu Lys 


Leu 




C \j\J 


wiy uys x i ix. 


iyr 


275 




xyr ax a wiy 


Val 






rne rnr A-ia 


Ti- 
ne 






oci rne jcx 


Asn 


Lys Leu Met 


Glu 




340 


xicu x\o vxy 


xiyo 


j j j 




Asn Val Thr 
x^^xj vai xxxi 


Phe 


370 




low PVio 2V ctj 
rX&ll XTlie AVOll 


Tie 
lie 


385 




Gl \/ Gl v 
vji jr Jci »xy 


Ser 


Tyr Asp Pro 


Asn 




420 


Thr T,«»ii Gin 

1 111 itCU 


Leu 


435 




Glu Pro Ala 

vl U Clw iLl C* 


Leu 


450 




Lvs Pro Asn 

AIT w 11 V ^v*9£s 


Ala 


465 




Asn Ala Hi •? 

4uJ 11 AXC1 111 ^> 


Ser 


Val Gly Glu 


Arg 




500 


Ala Tie Ala 
ax« lie /Via 


A TTT 

niy 


515 




Glu Ala Thr 
viu -*vi a x xn 


►J CI 


530 




Ala Leu Asp 


Arg 


545 




Arg Leu Cys 


Thr 


Gly Gin Val 


Val 




580 


Gly Ala Tyr 


Ala 


595 




Asp Phe Ser 


Asn 



120 

Gly Phe Phe Asp Thr Asp Ala 
135 

Asp Thr Leu 



Ser Thr 
150 
Asn Phe 
165 

Phe Val 

Gly lie 

Thr Ser 

Gin Ala 
230 
Lys Ala 
245 

Gly Tyr 

Gly He 

Phe lie 

Phe Ser 
310 
Leu Gly 
325 

He lie 

Cys Leu 

Ser Tyr 

Phe Phe 
390 
Gly Lys 
405 

Ser Gly 

Lys Phe 

Phe Ala 

Thr Met 
470 
Phe He 
485 

Gly Val 

Ala Met 

Ala Leu 

Val Met 
550 
He Arg 
565 

Glu Thr 
Ser Leu 
Pro Ser 



He His Tyr 

Ser Ala Trp 
185 

Ala Phe Ala 

200 
Lys Ser Arg 
215 

He Ala Gin 

Leu Asn Ala 

Lys Ala Gly 
265 

Ala Cys Met 

280 
Arg Asn Gly 
295 

Ala lie Val 

Ala Phe Ser 

Asn Gin Arg 
345 

Asp Gin Val 

360 
Pro Ser Arg 
375 

Pro Ser Gly 

Ser Thr Val 

Gin He Leu 
425 

Leu Arg Glu 

440 
Thr Thr He 
455 

Val Glu Val 

Thr Leu Leu 

Gin Leu Ser 
505 

Leu Lys Asp 

520 
Asp Ala Ser 
535 

Val Gly Arg 

Asn Val Asp 

Gly Thr His 
585 

He Arg Phe 

600 
Thr Arg Arg 



Leu Val 
155 
Leu Ser 
170 

Lys Leu 

Gly Gly 

Glu Ser 

Val Arg 
235 
Tyr Ser 
250 

Met Ala 

Ser Trp 

Gin Thr 

Gly Gly 
315 
Lys Gly 
330 

Pro Thr 

His Gly 

Pro Asp 

Lys Thr 
395 
Val Ser 
410 

Leu Asp 

Gin He 

Leu Glu 

Glu Ala 
475 
Pro Lys 
490 

Gly Gly 

Pro Lys 

Ser Glu 

Thr Thr 
555 
Ser He 
570 

Glu Glu 
Gin Glu 
Thr Arg 



125 
Arg Thr 
140 

Gin Asp 

Thr Phe 

Ala Leu 

Leu Tyr 
205 
Tyr Ala 
220 

Thr Val 

Asp Ala 

Lys Gly 

Ala Leu 
285 
Asp Gly 
300 

Met Ser 

Lys Ala 

He He 

Asn He 
365 
Val Met 
380 

Val Ala 

Leu He 

Gly Val 

Gly Leu 
445 
Asn He 
460 

Ala Ala 

Gly Tyr 

Gin Lys 

He Leu 
525 
Ser He 
540 

Val Val 

Ala Val 

Leu He 

Met Val 
605 
Ser Thr 



Gly Asp He 

Ala He Ser 
160 

Leu Ala Gly 

175 
Leu Ser Val 
190 

Ala Tyr Thr 

Asn Ala Gly 

Tyr Ser Tyr 
240 

He Gin Tyr 

255 
Leu Gly Leu 
270 

Val Phe Trp 

Gly Lys Ala 

Leu Gly Gin 
320 

Ala Gly Tyr 

335 
Gin Asp Pro 
350 

Glu Phe Lys 

He Phe Arg 

Val Val Gly 
400 

Glu Arg Phe 

415 
Glu lie Lys 
430 

Val Asn Gin 

Leu Tyr Gly 

Ser Ala Ala 
480 

Asp Thr Gin 

495 
Gin Arg lie 
510 

Leu Leu Asp. 

Val Gin Glu 

Val Ala His 
560 

He Gin Gin 

575 
Ala Lys Ser 
590 

Gly Thr Arg 
Arg Leu Ser 
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610 615 620 

His Ser Leu Ser Thr Lys Ser Leu Ser Leu Arg Ser Gly Ser Leu Arg 
625 630 635 640 

Asn Leu Ser Tyr Ser Tyr Ser Thr Gly Ala Asp Gly Arg lie Glu Met 

645 650 655 

lie Ser Asn Ala Glu Thr Asp Arg Lys Thr Arg Ala Pro Glu Asn Tyr 

660 665 670 

Phe Tyr Arg Leu; Leu Lys Leu Asn Ser Pro Glu Trp Pro Tyr Ser lie 

675 680 685 

Met Gly Ala Val Gly Ser lie Leu Ser Gly Phe He Gly Pro Thr Phe 

690 695 700 

Ala lie Val Met Ser Asn Met He Glu Val Phe Tyr Tyr Thr Asp Tyr 
705 710 715 720 

Asp Ser Met Glu Arg Lys Thr Lys Glu Tyr Val Phe lie Tyr He Gly 

725 730 735 

Ala Gly Leu Tyr Ala Val Gly Ala Tyr Leu He Gin His Tyr Phe Phe 

740 745 750 

Ser He Met Gly Glu Asn Leu Thr Thr Arg Val Arg Arg Met Met Leu 

755 760 765 

Ser Ala He Leu Arg Asn Glu Val Gly Trp Phe Asp Glu Asp Glu His 

770 775 780 

Asn Ser Ser Leu He Ala Ala Arg Leu Ala Thr Asp Ala Ala Asp Val 
785 790 795 800 

Lys Ser Ala He Ala Glu Arg He Ser Val He Leu Gin Asn Met Thr 

805 810 815 

Ser Leu Leu Thr Ser Phe He Val Ala Phe He Val Glu Trp Arg Val 

820 825 830 

Ser Leu Leu He Leu Gly Thr Phe Pro Leu Leu Val Leu Ala Asn Phe 

835 840 845 

Ala Gin Gin Leu Ser Leu Lys Gly Phe Ala Gly Asp Thr Ala Lys Ala 

850 855 860 

His Ala Lys Thr Ser Met He Ala Gly Glu Gly Val Ser Asn He Arg 
865 870 875 880 

Thr Val Ala Ala Phe Asn Ala Gin Ser Lys He Leu Ser Leu Phe Cys 

885 890 895 

His Glu Leu Arg Val Pro Gin Lys Arg Ser Leu Ser Leu Tyr Arg Ser 

900 905 910 

Gin Thr Ser Gly Phe Leu Phe Gly Leu Ser Gin Leu Ala Leu Tyr Gly 

915 * 920 925 

Ser Glu Ala Leu He Leu Trp Tyr Gly Ala His Leu Val Ser Lys Gly 

930 935 940 

Val Ser Thr Phe Ser Lys Val He Lys Val Phe Val Val Leu Val He 
945 950 955 960 

Thr Ala Asn Ser Val Ala Glu Thr Val Ser Leu Ala Pro Glu He He 

965 970 975 

Arg Gly Gly Glu Ala Val Gly Ser Val Phe Ser Val Leu Asp Arg Gin 

980 985 990 

Thr Arg lie Asp Pro Asp Asp Ala Asp Ala Asp Pro Val Glu Thr He 

995 1000 1005 

Arg Gly Asp He Glu Phe Arg His Val Asp Phe Ala Tyr Pro Ser Arg 

1010 1015 1020 

Pro Asp Val Met Val Phe Arg Asp Phe Asn Leu Arg He Arg Ala Gly 
1025 1030 1035 1040 

His Ser Gin Ala Leu Val Gly Ala Ser Gly Ser Gly Lys Ser Ser Val 

1045 1050 1055 

He Ala Met He Glu Arg Phe Tyr Asp Leu Leu Ala Gly Lys Val Met 

1060 1065 1070 

He Asp Gly Lys Asp lie Arg Arg Leu Asn Leu Lys Ser Leu Arg Leu 

1075 : 1080 1085 

Lys He Gly Leu Val Gin Gin Glu Pro Ala Leu Phe Ala Ala Thr He 

1090 1095 1100 

Phe Asp Asn He Ala Tyr Gly Lys Asp Gly Ala Thr Glu Ser Glu Val 
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1105 1110 1115 1120 

lie Asp Ala Ala Arg Ala Ala Asn Ala His Gly Phe lie Ser Gly Leu 

1125 1130 1135 

Pro Glu Gly Tyr Lys Thr Pro Val Gly Glu Arg Gly Val Gin Leu Ser 

1140 1145 1150 

Gly Gly Gin Lys Gin Arg lie Ala lie Ala Arg Ala Val Leu Lys Asn 

1155 " 1160 1165 

Pro Thr Val Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Ala Glu 

1170 1175 1180 

Ser Glu Cys Val Leu Gin Glu Ala Leu Glu Arg Leu Met Arg Gly Arg 
1185 1190 1195 1200 

Thr Thr Val Val Val Ala His Arg Leu Ser Thr He Arg Gly Val Asp 

1205 1210 1215 

Cys He Gly Val He Gin Asp Gly Arg He Val Glu Gin Gly Ser His 

1220 1225 1230 

Ser Glu Leu Val Ser Arg Pro Glu Gly Ala Tyr Ser Arg Leu Leu Gin 

1235 1240 1245 

Leu Gin Thr His Arg He 
1250 

<210> 3 

<211> 1280 

<212> PRT 

<213> Homo sapiens 

<300> 

<308> Genbank P08183 
<309> 1997-11-01 

<400> 3 



Met 
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Leu Glu 


Gly 


Asp 


Arg 


Asn 


Gly 


Gly Ala Lys Lys Lys 


Asn Phe 
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Asp Lys Lys Glu Lys 
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Thr 
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Ser Val 
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Phe 
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Tyr Ser Asn Trp Leu 
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45 




Leu 


Tyr 


Met Val 


Val 


Gly 


Thr 


Leu 


Ala 


Ala lie He His Gly 


Ala Gly 




50 






55 






60 




Leu 


Pro 


Leu Met 


Met 


Leu 


Val 


Phe 


Gly 


Glu Met Thr Asp He 


Phe Ala 


65 








70 






75 


80 


Asn 


Ala 


Gly Asn 


Leu 


Glu 


Asp 


Leu 


Met 


Ser Asn lie Thr Asn 


Arg Ser 








85 










90 


95 


Asp 


He 


Asn Asp 


Thr 


Gly 


Phe 


Phe 


Met 


Asn Leu Glu Glu Asp 


Met Thr 






100 










105 


110 




Arg 


Tyr 


Ala Tyr 


Tyr 


Tyr 


Ser 


Gly 


lie 


Gly Ala Gly Val Leu 


Val Ala 






115 








120 




125 




Ala 


Tyr 


He Gin 


Val 


Ser 


Phe 


Trp 


Cys 


Leu Ala Ala Gly Arg 


Gin lie 




130 








135 






140 




His 


Lys 


lie Arg 


Lys 


Gin 


Phe 


Phe 


His 


Ala He Met Arg Gin 


Glu lie 


145 


150 








155 


160 


Gly 


Trp 


Phe Asp 


Val 


His 


Asp 


Val 


Gly 


Glu Leu Asn Thr Arg 


Leu Thr 








165 










170 


175 


Asp 


Asp 


Val Ser 


Lys 


He 


Asn 


Glu 


Val 


He Gly Asp Lys He 


Gly Met 






180 










185 


190 




Phe 


Phe 


Gin Ser 


Met 


Ala 


Thr 


Phe 


Phe 


Thr Gly Phe He Val 


Gly Phe 






195 








200 




205 




Thr 


Arg 


Gly Trp 


Lys 


Leu 


Thr 


Leu 


Val 


He Leu Ala He Ser 


Pro Val 




210 








215 






220 




Leu 


Gly 


Leu Ser 


Ala 


Ala 


Val 


Trp 


Ala 


Lys He Leu Ser Ser 


Phe Thr 


225 






230 








235 


240 


Asp 


Lys 


Glu Leu 


Leu 


Ala 


Tyr 


Ala 


Lys 


Ala Gly Ala Val Ala 


Glu Glu 






245 










250 


255 
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Val Leu Ala Ala lie Arg Thr Val lie Ala Phe Gly Gly Gin Lys Lys 

260 265 270 

Glu Leu Glu Arg Tyr Asn Lys Asn Leu Glu Glu Ala Lys Arg lie Gly 

275 280 . 285 

He Lys Lys Ala He Thr Ala Asn He Ser He Gly Ala Ala Phe Leu 

290 295 . 300 

Leu He Tyr Ala Ser Tyr Ala Leu Ala Phe Trp Tyr Gly Thr Thr Leu 
305 " 310 315 320 

Val Leu Ser Gly Glu Tyr Ser He Gly Gin Val Leu Thr Val Phe Phe 

325 330 335 

Ser Val Leu He Gly Ala Phe Ser Val Gly Gin Ala Ser Pro Ser He 

340 345 350 

Glu Ala Phe Ala Asn Ala Arg Gly Ala Ala Tyr Glu He Phe Lys He 

355 360 365 

He Asp Asn Lys Pro Ser He Asp Ser Tyr Ser Lys Ser Gly His Lys 

370 375 380 

Pro Asp Asn He Lys Gly Asn Leu Glu Phe Arg Asn Val His Phe Ser 
385 390 395 400 

Tyr Pro Ser Arg Lys Glu Val Lys He Leu Lys Gly Leu Asn Leu Lys 

405 410 415 

Val Gin Ser Gly Gin Thr Val Ala Leu Val Gly Asn Ser Gly Cys Gly 

420 425 430 

Lys Ser Thr Thr Val Gin Leu Met Gin Arg Leu Tyr Asp Pro Thr Glu 

435 440 445 

Gly Met Val Ser Val Asp Gly Gin Asp He Arg Thr He Asn Val Arg 

450 455 460 

Phe Leu Arg Glu He He Gly Val Val Ser Gin Glu Pro Val Leu Phe 
465 470 475 480 

Ala Thr Thr He Ala Glu Asn He Arg Tyr Gly Arg Glu Asn Val Thr 

485 490 495 

Met Asp Glu He Glu Lys Ala Val Lys Glu Ala Asn Ala Tyr Asp Phe 

500 505 510 

He Met Lys Leu Pro His Lys Phe Asp Thr Leu Val Gly Glu Arg Gly 

515 520 525 

Ala Gin Leu Ser Gly Gly Gin Lys Gin Arg He Ala He Ala Arg Ala 

530 "" 535 540 

Leu Val Arg Asn Pro Lys He Leu Leu Leu Asp Glu Ala Thr Ser Ala 
545 550 555 560 

Leu Asp Thr Glu Ser Glu Ala Val Val Gin Val Ala Leu Asp Lys Ala 

565 570 575 

Arg Lys Gly Arg Thr Thr He Val He Ala His Arg Leu Ser Thr Val 

580 585 590 

Arg Asn Ala Asp Val lie Ala Gly Phe Asp Asp Gly Val He Val Glu 

595 600 605 

Lys Gly Asn His Asp Glu Leu Met Lys Glu Lys Gly He Tyr Phe Lys 

610 615 620 

Leu Val Thr Met Gin Thr Ala Gly Asn Glu Val Glu Leu Glu Asn Ala 
625 630 635 640 

Ala Asp Glu Ser Lys Ser Glu lie Asp Ala Leu Glu Met Ser Ser Asn 

645 650 655 

Asp Ser Arg Ser Ser Leu He Arg Lys Arg Ser Thr Arg Arg Ser Val 

660 665 670 

Arg Gly Ser Gin Ala Gin Asp Arg Lys Leu Ser Thr Lys Glu Ala Leu 

675 680 685 

Asp Glu Ser He Pro Pro Val Ser Phe Trp Arg lie Met Lys Leu Asn 

690 695 700 

Leu Thr Glu Trp Pro Tyr Phe Val Val Gly Val Phe Cys Ala He He 
705 * 710 715 720 

Asn Gly Gly Leu Gin Pro Ala Phe Ala He He Phe Ser Lys He He 

725 730 735 

Gly Val Phe Thr Arg He Asp Asp Pro Glu Thr Lys Arg Gin Asn Ser 
740 745 750 
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He 
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Val 


He Thr 


Gin Asn He Ala 


Asn 
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Thr 


Gly 


He 


He 






835 




840 








845 
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Phe 


He Tyr 


Gly Trp Gin Leu 


Thr 


Leu 


Leu 


Leu 


Leu 


Ala 


He 




850 




855 






860 










Val 


Pro 


He 


lie Ala 


He Ala Gly Val 


Val 


Glu 


Met 


Lys 


Met 


Leu 


Ser 


865 








870 




875 










880 


Gly 


Gin 


Ala 


Leu Lys 


Asp Lys Lys Glu 


Leu 


Glu 


Gly 


Ala 


Gly 


Lys 


lie 






885 




890 










895 




Ala 


Thr 


Glu 


Ala lie 


Glu Asn Phe Arg 


Thr 


Val 


Val 


Ser 


Leu 


Thr 


Gin 








900 


905 










910 






Glu 


Gin 


Lys 


Phe Glu 


His Met Tyr Ala 


Gin 


Ser 


Leu 


Gin 


Val 


Pro 


Tyr 






915 




920 








925 








Arg 


Asn 


Ser 


Leu Arg 


Lys Ala His He 


Phe 


Gly 


He 


Thr 


Phe 


Ser 


Phe 


930 






935 
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Thr 
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Ala 


Met Met 


Tyr Phe Ser Tyr 
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Phe 


Arg 


Phe 


Gly 
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Ser 


Phe 


Glu 
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Val 
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Leu 






965 
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Ser 
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Val Phe Gly Ala 
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Ala 


Val 


Gly 
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Val 


Ser 








980 
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Ser 


Phe 
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Pro Asp 


Tyr Ala Lys Ala 
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He 


Ser 


Ala 


Ala 
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Met 
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He Glu 
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He 
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Ser 
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1010 
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Gly 


Leu 


Met 


Pro Asn 


Thr Leu Glu Gly 


Asn 


Val 


Thr Phe Gly 


Glu 


Val 


1025 






1030 




1035 








1040 


Val 


Phe Asn 


Tyr Pro 


Thr Arg Pro Asp 


He 


Pro 


Val 


Leu 


Gin 


Gly 


Leu 








104! 


1050 








1055 


Ser 


Leu 


Glu 


Val Lys Lys Gly Gin Thr 


Leu Ala Leu Val Gly 


Ser 


Ser 



1060 1065 1070 



Gly Cys Gly Lys Ser Thr Val Val Gin Leu Leu Glu Arg Phe Tyr Asp 

1075 1080 1085 

Pro Leu Ala Gly Lys Val Leu Leu Asp Gly Lys Glu He Lys Arg Leu 

1090 * ~ 1095 1100 

Asn Val Gin Trp Leu Arg Ala His Leu Gly He Val Ser Gin Glu Pro 
1105 * 1110 1115 1120 

He Leu Phe Asp Cys Ser He Ala Glu Asn lie. Ala Tyr Gly Asp Asn 

1125 1130 1135 

Ser Arg Val Val Ser Gin Glu Glu He Val Arg Ala Ala Lys Glu Ala 

1140 1145 1150 

Asn He His Ala Phe He Glu Ser Leu Pro Asn Lys Tyr Ser Thr Lys 

1155 1160 1165 

Val Gly Asp Lys Gly Thr Gin Leu Ser Gly Gly Gin Lys Gin Arg He 

1170 1175 1180 

Ala lie Ala Arg Ala Leu Val Arg Gin Pro His He Leu Leu Leu Asp 
1185 1190 1195 1200 

Glu Ala Thr Ser Ala Leu Asp Thr Glu Ser Glu Lys Val Val Gin Glu 

1205 1210 1215 

Ala Leu Asp Lys Ala Arg Glu Gly Arg Thr Cys He Val He Ala His 

1220 ~ 1225 1230 

Arg Leu Ser Thr He Gin Asn Ala Asp Leu He Val Val Phe Gin Asn 
1235 1240 1245 



BNSOOCIO: <WO. 



_G018886MJ_> 



WO 00/18886 



PCT/US99/22363 



8 — 

Gly Arg Val Lys Glu His Gly Thr His Gin Gin Leu Leu Ala Gin Lys 

1250 1255 1260 

Gly lie Tyr Phe Ser Met Val Ser Val Gin Ala Gly Thr Lys Arg Gin 
1265 1270 1275 1280 

<210> 4 
<211> 1175 
<212> PRT 

<213> Mus musculus 
<300> 

<308> Genbank P06795 
<309> 1998-07-15 



<400> 4 



Ser 


Asn 


Ser 


Ser 


Leu 


Glu 


Glu 


Glu Met Ala He 


Tyr 


Ala Tyr Tyr Tyr 


1 








5 






10 




15 


Thr 


Gly 


He 


Gly 


Ala 


Gly 


Val 


Leu He Val Ala 


Tyr 


He Gin Val Ser 






20 








25 




30 


Leu 


Trp 


Cys 


Leu 


Ala 


Ala 


Gly 


Arg Gin He His 


Lys 


He Arg Gin Lys 






35 










40 




45 


Phe 


Phe 


His 


Ala 


He 


Met 


Asn 


Gin Glu He Gly 


Trp 


Phe Asp Val His 




50 










55 




60 




Asp 


Val 


Gly 


Glu 


Leu 


Asn 


Thr 


Arg Leu Thr Asp 


Asp 


Val Ser Lys He 


65 










70 




75 




80 


Asn 


Asp 


Gly 


He 


Gly 


Asp 


Lys 


He Gly Met Phe 


Phe 


Gin Ser He Thr 










85 






90 




95 


Thr 


Phe 


Leu 


Ala 


Gly 


Phe 


He 


He Gly Phe He 


Ser 


Gly Trp Lys Leu 








100 






105 




110 


Thr 


Leu 


Val 


He 


Leu 


Ala 


Val 


Ser Pro Leu He 


Gly 


Leu Ser Ser Ala 






115 










120 




125 


Leu 


Trp 


Ala 


Lys 


Val 


Leu 


Thr 


Ser Phe Thr Asn 


Lys 


Glu Leu Gin Ala 




130 










135 




140 




Tyr 


Ala 


Lys 


Ala 


Gly 


Ala 


Val 


Ala Glu Glu Val 


Leu 


Ala Ala He Arg 


145 










150 




155 




160 


Thr 


Val 


He 


Ala 


Phe 


Gly 


Gly 


Gin Gin Lys Glu 


Leu 


Glu Arg Tyr Asn 










165 






170 




175 


Lys 


Asn 


Leu 


Glu 


Glu 


Ala 


Lys 


Asn Val Gly He 


Lys 


Lys Ala He Thr 








180 








185 




190 


Ala 


Ser 


He 


Ser 


He 


Gly 


He 


Ala Tyr Leu Leu 


Val 


Tyr Ala Ser Tyr 






195 








200 




205 


Ala 


Leu 


Ala 


Phe 


Trp 


Tyr 


Gly 


Thr Ser Leu Val 


Leu 


Ser Asn Glu Tyr 




210 










215 




220 




Ser 


He 


Gly 


Glu 


Val 


Leu 


Thr 


Val Phe Phe Ser 


He 


Leu Leu Gly Thr 


225 








230 




235 




240 


Phe 


Ser 


He 


Gly 


His 


Leu 


Ala 


Pro Asn He Glu 


Ala 


Phe Ala Asn Ala 








245 






250 




255 


Arg 


Gly 


Ala 


Ala 


Phe 


Glu 


He 


Phe Lys He He 


Asp 


Asn Glu Pro Ser 








260 








265 




270 


lie 


Asp 


Ser 


Phe 


Ser 


Thr 


Lys 


Gly Tyr Lys Pro 


Asp 


Ser He Met Gly 






275 










280 




285 


Asn 


Leu 


Glu 


Phe 


Lys 


Asn 


Val 


His Phe Asn Tyr 


Pro 


Ser Arg Ser Glu 




290 










295 




300 




Val 


Gin 


He 


Leu 


Lys 


Gly 


Leu 


Asn Leu Lys Val 


Lys 


Ser Gly Gin Thr 


305 










310 




315 




320 


Val 


Ala 


Leu 


Val 


Gly 


Asn 


Ser 


Gly Cys Gly Lys 


Ser 


Thr Thr Val Gin 










325 






330 




335 


Leu 


Met 


Gin 


Arg 


Leu 


Tyr 


Asp 


Pro Leu Glu Gly 


Val 


Val Ser He Asp 








340 




345 




350 


Gly 


Gin 


Asp 


He 


Arg 


Thr 


He 


Asn Val Arg Tyr 


Leu 


Arg Glu He He 






355 










360 




365 


Gly 


Val 


Val 


Ser 


Gin 


Glu 


Pro 


Val Leu Phe Ala 


Thr 


Thr He Ala Glu 
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370 375 380 

Asn He Arg Tyr Gly Arg Glu Asp Val Thr Met Asp Glu He Glu Lys 
385 390 395 400 

Ala Val Lys Glu Ala Asn Ala Tyr Asp Phe He Met Lys Leu Pro His 

405 410 415 

Gin Phe Asp Thr Leu Val Gly Glu Arg Gly Ala Gin Leu Ser Gly Gly 

420 425 430 

Gin Lys Gin Arg He Ala He Ala Arg Ala Leu Val Arg Asn Pro Lys 

435 440 445 

He Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Thr Glu Ser Glu 

450 455 460 

Ala Val Val Gin Ala Ala Leu Asp Lys Ala Arg Glu Gly Arg Thr Thr 
465 470 475 480 

He Val He Ala His Arg Leu Ser Thr Val Arg Asn Ala Asp Val He 

485 490 495 

Ala Gly Phe Asp Gly Gly Val He Val Glu Gin Gly Asn His Asp Glu 

500 505 510 

Leu Met Arg Glu Lys Gly He Tyr Phe Lys Leu Val Met Thr Gin Thr 

515 520 525 

Arg Gly Asn Glu lie Glu Pro Gly Asn Asn Ala Tyr Gly Ser Gin Ser 

530 535 540 

Asp Thr Asp Ala Ser Glu Leu Thr Ser Glu Glu Ser Lys Ser Pro Leu 
545 550 555 560 

He Arg Arg Ser He Tyr Arg Ser Val His Arg Lys Gin Asp Gin Glu 

565 570 575 

Arg Arg Leu Ser Met Lys Glu Ala Val Asp Glu Asp Val Pro Leu Val 

580 585 590 

Ser Phe Trp Arg He Leu Asn Leu Asn Leu Ser Glu Trp Pro Tyr Leu 

595 600 605 

Leu Val Gly Val Leu Cys Ala Val He Asn Gly Cys He Gin Pro Val 

610 615 620 

Phe Ala He Val Phe Ser Arg He Val Gly Val Phe Ser Arg Asp Asp 
625 630 635 640 

Asp His Glu Thr Lys Arg Gin Asn Cys Asn Leu Phe Ser Leu Phe Phe 

645 m . 650 655 

Leu Val Met Gly Leu He Ser Phe Val Thr Tyr Phe Phe Gin Gly Phe 

660 665 670 

Thr Phe Gly Lys Ala Gly Glu He Leu Thr Lys Arg Val Arg Tyr Met 

675 680 685 

Val Phe Lys Ser Met Leu Arg Gin Asp He Ser Trp Phe Asp Asp His 

690 695 700 

Lys Asn Ser Thr Gly Ser Leu Thr Thr Arg Leu Ala Ser Asp Ala Ser 
705 710 715 720 

Ser Val Lys Gly Ala Met Gly Ala Arg Leu Ala Val Val Thr Gin Asn 

725 730 735 

Val Ala Asn Leu Gly Thr Gly Val He Leu Ser Leu Val Tyr Gly Trp 

740 745 750 

Gin Leu Thr Leu Leu Leu Val Val He He Pro Leu He Val Leu Gly 

755 760 765 

Gly He He Glu Met Lys Leu Leu Ser Gly Gin Ala Leu Lys Asp Lys 

770 775 780 

Lys Gin Leu Glu He Ser Gly Lys He Ala Thr Glu Ala He Glu Asn 
785 790 795 800 

Phe Arg Thr He Val Ser Leu Thr Arg Glu Gin Lys Phe Glu Thr Met 

805 810 815 

Tyr Ala Gin Ser Leu Gin Val Pro Tyr Arg Asn Ala Met Lys Lys Ala 

820 825 830 

His Val Phe Gly He Thr Phe Ser Phe Thr Gin Ala Met Met Tyr Phe 

835 840 845 

Ser Tyr Ala Ala Cys Phe Arg Phe Gly Ala Tyr Leu Val Ala Gin Gin 

850 855 860 

Leu Met Thr Phe Glu Asn Val Met Leu Val Phe Ser Ala Val Val Phe 
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865 








870 








875 










880 


Gly 


Ala 


Met 


Ala Ala 


Gly 


Asn 


Thr Ser 


Ser 


Phe 


Ala 


Pro 


Asp 


Tyr 


Ala 






885 








890 










895 




Lys 


Ala 


Lys 


Val Ser 


Ala 


Ser 


His He 


He 


Arg 


He 


He 


Glu 


Lys 


Thr 






900 






905 










910 






Pro 


Glu 


He 


Asp Ser 


Tyr 


Ser 


Thr Glu 


Gly 


Leu 


Lys 


Pro 


Thr 


Leu 


Leu 






915 








920 








925 








Glu 


Gly 


Asn 


Val Lys 


Phe 


Asn 


Gly Val 


Gin 


Phe 


Asn 


Tyr 


Pro 


Thr 


Arg 




930 








935 








940 










Pro 


Asn 


lie 


Pro Val 


Leu 


Gin 


Gly Leu 


Ser 


Leu 


Glu 


Val 


Lys 


Lys 


Gly 


945 








950 








955 










960 


Gin 


Thr 


Leu 


Ala Leu 


Val 


Gly 


Ser Ser 


Gly 


Cys 


Gly 


Lys 


Ser 


Thr 


Val 








965 








970 










975 




Val 


Gin 


Leu 


Leu Glu 


Arg 


Phe 


Tyr Asp 


Pro 


Met 


Ala 


Gly 


Ser 


Val 


Phe 








980 






985 










990 






Leu 


Asp 


Gly 


Lys Glu 


He 


Lys 


Gin Leu 


Asn 


Val 


Gin 


Trp 


Leu 


Arg 


Ala 






995 








1000 








1005 






His 


Leu 


Gly 

) 


He Val 


Ser 


Gin 


Glu Pro 


He 


Leu 


Phe 


Asp Cys 


Ser 


He 




10K 






1015 






1020 








Ala 


Glu Asn 


He Ala 


Tyr 


Gly Asp Asn 


Ser 


Arg 


Ala 


Val 


Ser 


His 


Glu 


1025 






1030 






1035 








1040 


Glu 


lie 


Val 


Arg Ala 


Ala 


Lys 


Glu Ala 


Asn 


He 


His 


Gin 


Phe 


He 


Asp 








1045 






1050 








1055 


Ser 


Leu 


Pro 


Asp Lys Tyr Asn Thr Arg 


Val Gly Asp Lys 


Gly 


Thr 


Gin 








1060 






1065 








1070 




Leu Ser Gly 


Gly Gin Lys Gin Arg He Ala He Ala Arg Ala Leu Val 



1075 1080 1085 



Arg Gin Pro His He Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp 

1090 1095 1100 

Thr Glu Ser Glu Lys Val Val Gin Glu Ala Leu Asp Lys Ala Arg Glu 
1105 1110 1115 1120 

Gly Arg Thr Cys He Val He Ala His Arg Leu Ser Thr He Gin Asn 

1125 1130 1135 

Ala Asp Leu He Val Val He Glu Asn Gly Lys Val Lys Glu His Gly 

1140 1145 1150 

Thr His Gin Gin Leu Leu Ala Gin Lys Gly He Tyr Phe Ser Met Val 

1155 1160 1165 

Gin Ala Gly Ala Lys Arg Ser 
1170 1175 

<210> 5 
<211> 1066 
<212> PRT 

<213> Homo sapiens 





<300> 












<308> 


Genbank 


P21439 








<309> 


1998-07- 


•15 








<400> 


5 








Trp 


Lys Leu 


Thr Leu 


Val He Met Ala He 


Ser Pro He Leu Gly 


Leu 


1 




5 


10 


15 




Ser 


Ala Ala 


Val Trp 


Ala Lys tie Leu Ser 


Ala Phe Ser Asp Lys 


Glu 






20 


25 


30 




Leu 


Ala Ala 


Tyr Ala 


Lys Ala Gly Ala Val 


Ala Glu Glu Ala Leu 


Gly 




35 




40 


45 




Ala 


He Arg 


Thr Val 


He Ala Phe Gly Gly 


Gin Asn Lys Glu Leu 


Glu 




50 




55 


60 




Arg 


Tyr Gin 


Lys His 


Leu Glu Asn Ala Lys 


Glu He Gly He Lys 


Lys 


65 






70 


75 


80 


Ala 


lie Ser 


Ala Asn 


lie Ser Met Gly He 


Ala Phe Leu Leu He 


Tyr 






85 


90 


95 
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Ala 


Ser 


Tyr Ala 






100 


Lys 


Glu 


Tyr Thr 






115 


He 


Gly 


Ala Phe 




130 




Ala 


Asn 


Ala Arg 


145 






Asn 


Pro 


Lys He 


He 


Lys 


Gly Asn 






180 


Arg 


Ala 


Asn Val 




195 


Gly 


Gin 


Thr Val 


210 




Thr 


Val 


Gin Leu 


225 






Asn 


He 


Asp Gly 


Glu 


He 


He Gly 






260 


He 


Ala 


Glu Asn 






275 


He 


Lvs 


Lys Ala 




290 




Leu 


Pro 


Gin Lys 


305 






Ser 


Gly 


Gly Gin 


Asn 


Pro 


Lys He 






340 


Glu 


Ser 


Glu Ala 






355 


Arq 


Thr 


Thr lie 




370 




Asp 


Val 


He Ala 


385 






His 


Ser 


Glu Leu 


Met 


Gin 


Thr Ser 






420 


Asp 


Glu 


Lys Ala 






435 


Leu 


Phe 


Arg His 




450 




Gin 


Lys 


Ser Leu 


465 






Pro 


Val 


Ser Phe 


Tyr 


Phe 


Val Val 






500 


Pro 


Ala 


Phe Ser 






515 


Gly 


Asp 


Asp Ala 




530 




Phe 


Leu 


Phe Leu 


545 






Phe 


Thr 


Phe Gly 


Met 


Ala 


Phe Lys 






580 



Leu Ala Phe Trp Tyr Gly Ser Thr 
105 

He Gly Asn Ala Met Thr Val Phe 
120 

Ser Val Gly Gin Ala Ala Pro Cys 
135 140 
Gly Ala Ala Tyr Val He Phe Asp 

150 155 
Asp Ser Phe Ser Glu Arg Gly His 
165 170 
Leu Glu Phe Asn Asp Val His Phe 
185 

Lys He Leu Lys Gly Leu Asn Leu 
200 

Ala Leu Val Gly Ser Ser Gly Cys 
215 220 
He Gin Arg Leu Tyr Asp Pro Asp 

230 235 
Gin Asp He Arg Asn Phe Asn Val 
245 250 
Val Val Ser Gin Glu Pro Val Leu 
265 

He Cys Tyr Gly Arg Gly Asn Val 
280 

Val Lys Glu Ala Asn Ala Tyr Glu 
295 300 
Phe Asp Thr Leu Val Gly Glu Arg 

310 315 
Lys Gin Arg He Ala He Ala Arg 
325 330 
Leu Leu Leu Asp Glu Ala Thr Ser 
345 

Glu Val Gin Ala Ala Leu Asp Lys 
360 

Val He Ala His Arg Leu Ser Thr 
375 380 
Gly Phe Glu Asp Gly Val He Val 

390 395 
Met Lys Lys Glu .Gly Val Tyr Phe 
405 410 
Gly Ser Gin He Gin Ser Glu Glu 
425 

Ala Thr Arg Met Ala Pro Asn Gly 
440 

Ser Thr Gin Lys Asn Leu Lys Asn 
455 460 
Asp Val Glu Thr Asp Gly Leu Glu 

470 475 
Leu Lys Val Leu Lys Leu Asn Lys 
485 490 
Gly Thr Val Cys Ala He Ala Asn 
505 

Val He Phe Ser Glu lie He Ala 
520 

Val Lys Gin Gin Lys Cys Asn He 
535 540 
Gly He He Ser Phe Phe Thr Phe 

550 555 
Lys Ala Gly Glu He Leu Thr Arg 
565 570 
Ala Met Leu Arg Gin Asp Met Ser 
585 



Leu Val He Ser 
110 

Phe Ser He Leu 
125 

He Asp Ala Phe 

He He Asp Asn 
160 

Lys Pro Asp Ser 
175 

Ser Tyr Pro Ser 
190 

Lys Val Gin Ser 
205 

Gly Lys Ser Thr 

Glu Gly Thr He 
240 

Asn Tyr Leu Arg 
255 

Phe Ser Thr Thr 
270 

Thr Met Asp Glu 
285 

Phe lie Met Lys 

Gly Ala Gin Leu 
320 

Ala Leu Val Arg 
335 

Ala Leu Asp Thr 
350 

Ala Arg Glu Gly 

365. 

Val Arg Asn Ala 

Glu Gin Gly Ser 
400 

Lys Leu Val Asn 
415 

Phe Glu Leu Asn 
430 

Trp Lys Ser Arg 
445 

Ser Gin Met Cys 

Ala Asn Val Pro 
480 

Thr Glu Trp Pro 
495 

Gly Gly Leu Gin 
510 

He Phe Gly Pro 
525 

Phe Ser Leu He 

Phe Leu Gin Gly 
560 

Arg Leu Arg Ser 
575 

Trp Phe Asp Asp 
590 
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His Lys Asn Ser Thr Gly Ala Leu Ser Thr Arg Leu Ala Thr Asp Ala 

595 600 605 

Ala Gin Val Gin Gly Ala Thr Gly Thr Arg Leu Ala Leu He Ala Gin 

610 615 620 

Asn He Ala Asn Leu Gly Thr Gly He He He Ser Phe He Tyr Gly 
625 630 635 640 

Trp Gin Leu Thr Leu Leu Leu Leu Ala Val Val Pro He He Ala Val 

645 650 655 

Ser Gly He Val Glu Met Lys Leu Leu Ala Gly Asn Ala Lys Arg Asp 

660 665 670 

Lys Lys Glu Leu Glu Ala Ala Gly Lys He Ala Thr Glu Ala He Glu 

675 680 685 

Asn He Arg Thr Val Val Ser Leu Thr Gin Glu Arg Lys Phe Glu Ser 

690 695 700 

Met Tyr Val Glu Lys Leu Tyr Gly Pro Tyr Arg Asn Ser Val Gin Lys 
705 710 715 720 

Ala His He Tyr Gly He Thr Phe Ser He Ser Gin Ala Phe Met Tyr 

725 730 735 

Phe Ser Tyr Ala Gly Cys Phe Arg Phe Gly Ala Tyr Leu He Val Asn 

740 745 750 

Gly His Met Arg Phe Arg Asp Val He Leu Val Phe Ser Ala He Val 

755 760 765 

Phe Gly Ala Val Ala Leu Gly His Ala Ser Ser Phe Ala Pro Asp Tyr 

770 775 780 

Ala Lys Ala Lys Leu Ser Ala Ala His Leu Phe Met Leu Phe Glu Arg 
785 " 790 795 800 

Gin Pro Leu He Asp Ser Tyr Ser Glu Glu Gly Leu Lys Pro Asp Lys 

805 810 815 

Phe Glu Gly Asn He Thr Phe Asn Glu Val Val Phe Asn Tyr Pro Thr 

820 825 830 

Arg Ala Asn Val Pro Val Leu Gin Gly Leu Ser Leu Glu Val Lys Lys 

835 840 845 

Gly Gin Thr Leu Ala Leu Val Gly Ser Ser Gly Cys Gly Lys Ser Thr 

850 855 860 

Val Val Gin Leu Leu Glu Arg Phe Tyr Asp Pro Leu Ala Gly Thr Val 
865 870 875 880 

Leu Leu Asp Gly Gin Glu Ala Lys Lys Leu Asn Val Gin Trp Leu Arg 

885 890 895 

Ala Gin Leu Gly He Val Ser Gin Glu Pro He Leu Phe Asp Cys Ser 

900 905 910 

He Ala Glu Asn He Ala Tyr Gly Asp Asn Ser Arg Val Val Ser Gin 

915 920 925 

Asp Glu lie Val Ser Ala Ala Lys Ala Ala Asn He His Pro Phe He 

930 935 940 

Glu Thr Leu Pro His Lys Tyr Glu Thr Arg Val Gly Asp Lys Gly Thr 
945 950 955 960 

Gin Leu Ser Gly Gly Gin Lys Gin Arg He Ala He Ala Arg Ala Leu 

965 970 975 

He Arg Gin Pro Gin He Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu 

980 985 990 

Asp Thr Glu Ser Glu Lys Val Val Gin Glu Ala Leu Asp Lys Ala Arg 

995 1000 1005 

Glu Gly Arg Thr Cys He Val He Ala His Arg Leu Ser Thr He Gin 

1010 1015 1020 

Asn Ala Asp Leu He Val Val Phe Gin Asn Gly Arg Val Lys Glu His 
1025 1030 1035 1040 

Gly Thr His Gin Gin Leu Leu Ala Gin Lys Gly He Tyr Phe Ser Met 

1045 1050 1055 

Val Ser Val Gin Ala Gly Thr Gin Asn Leu 
1060 1065 

<210> 6 
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<211> 1266 
<212> PRT 

<213> Mus mus cuius 
<300> 

<308> Genbank P21440 
<309> 1997-11-01 

<400> 6 

Trp Lys Leu Thr Leu Val lie Met Ala lie Ser Pro lie Leu Gly Leu 

1 5 10 15 

Ser Thr Ala Val Trp Ala Lys lie Leu Ser Thr Phe Ser Asp . Lys Glu 

20 25 30 

Leu Ala Ala Tyr Ala Lys Ala Gly Ala Val Ala Glu Glu Ala Pro Gly 

35 40 45 

Ala lie Arg Thr Val lie Ala Phe Gly Gly Gin Asn Lys Glu Leu Glu 

50 55 60 

Arg Tyr Gin Lys His Leu Glu Asn Ala Lys Lys lie Gly lie Lys Lys 
65 70 75 80 

Ala lie Ser Ala Asn lie Ser Met Gly lie Ala Phe Leu Leu lie Tyr 

85 90 95 

Ala Ser Tyr Ala Leu Ala Phe Trp Tyr Gly Ser Thr Leu Val lie Ser 

100 105 110 

Lys Glu Tyr Thr He Gly Asn Ala Met Thr Val Phe Phe Ser He Leu 

115 120 125 

He Gly Ala Phe Ser Val Gly Gin Ala Ala Pro Cys He Asp Ala Phe 

130 135 140 

Ala Asn Ala Arg Gly Ala Ala Tyr Val He Phe Asp He He Asp Asn 
145 150 155 160 

Asn Pro Lys He Asp Ser Phe Ser Glu Arg Gly His Lys Pro Asp Asn 

165 170 175 

He Lys Gly Asn Leu Glu Phe Ser Asp Val His Phe Ser Tyr Pro Ser 

180 185 190 

Arg Ala Asn He Lys He Leu Lys Gly Leu Asn Leu Lys Val Lys Ser 

195 200 205 

Gly Gin Thr Val Ala Leu Val Gly Asn Ser Gly Cys Gly Lys Ser Thr 

210 215 220 

Thr Val Gin Leu Leu Gin Arg Leu Tyr Asp Pro Thr Glu Gly Lys He 
225 230 235 240 

Ser He Asp Gly Gin Asp He Arg Asn Phe Asn Val Arg Cys Leu Arg 

245 250 255 

Glu He He Gly Val Val Ser Glh Glu Pro Val Leu Phe Ser Thr Thr 

260 265 270 

He Ala Glu Asn He Arg Tyr Gly Arg Gly Asn Val Thr Met Asp Glu 

275 280 285 

He Glu Lys Ala Val Lys Glu Ala Asn Ala Tyr Asp Phe He Met Lys 

290 295 300 

Leu Pro Gin Lys Phe Asp Thr Leu Val Gly Asp Arg Gly Ala Gin Leu 
305 310 315 320 

Ser Gly Gly Gin Lys Gin Arg He Ala lie Ala Arg Ala Leu Val Arg 

325 330 335 

Asn Pro Lys He Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Thr 

340 345 350 

Glu Ser Glu Ala Glu Val Gin Ala Ala Leu Asp Lys Ala Arg Glu Gly 

355 360 365 

Arg Thr Thr He Val He Ala His Arg Leu Ser Thr He Arg Asn Ala 

370 375 380 

Asp Val He Ala Gly Phe Glu Asp Gly Val He Val Glu Gin Gly Ser 
385 390 395 400 

His Ser Glu Leu Met Lys Lys Glu Gly He Tyr Phe Arg Leu Val Asn 

405 410 415 

Met Gin Thr Ala Gly Ser Gin He Leu Ser Glu Glu Phe Glu Ala Arg 
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420 



Ala 


Leu 


Val 


Arg Asn Pro Lys He 






435 


440 


Ala 


Leu 


Asp 


Thr Glu Ser Glu Ala 




450 




455 


Ala 


Arg 


Lys 


Gly Arg Thr Thr He 


465 






470 


Val 


Arg 


Asn 


Ala Asp Val He Ala 








485 


Glu 


Lys 


Gly 


Asn His Asp Glu Leu 








500 


Lys 


Leu 


Val 


Thr Met Gin Thr Ala 




515 


520 


Ala 


Ala 


Ala 


Arg Ala Leu Val Arg 




530 




535 


Glu 


Ala 


Thr 


Ser Ala Leu Asp Thr 


545 






550 


Ala 


Leu 


Asp 


Lys Ala Arg Glu Gly 








565 


Arg 


Leu 


Ser 


Thr Val Arg Asn Ala 








580 


Gly 


Val 


He 


Val Glu Gin Gly Asn 




595 


600 


Gly 


He 


Tyr 


Phe Lys Leu Val Met 




610 




615 


Glu 


Pro 


Gly 


Asn Asn Ala Val Glu 


625 






630 


Asp 


Val 


Ala 


Pro Asn Gly Trp Lys 








645 


Lys 


Lys 


Ser 


Leu Lys Ser Pro His 








660 


Asn 


Glu 


Leu 


Asp Ala Asn Val Pro 






675 


680 


Lys 


Leu 


Asn 


Lys Thr Glu Trp Pro 




690 




695 


Ala 


He 


Ala 


Asn Gly Ala Leu Gin 


705 






710 


Glu 


Met 


He 


Ala He Phe Gly Pro 








725 


Lys 


Cys 


Asn 


Met Phe Ser Leu Val 




740 


Phe 


Phe 


Thr 


Phe Phe Leu Gin Gly 






755 


760 


He 


Leu 


Thr 


Thr Arg Leu Arg Ser 




770 




775 


Gin 


Asp 


Met 


Ser Trp Phe Asp Asp 


785 






790 


Ser 


Thr 


Arg 


Leu Ala Thr Asp Ala 








805 


Thr 


Lys 


Leu 


Ala Leu He Ala Gin 








820 


He 


He 


He 


Ser Phe He Tyr Gly 






835 


840 


Ser 


Val 


Val 


Pro Phe He Ala Val 




850 




855 


Leu 


Ala 


Gly 


Asn Ala Lys Arg Asp 


865 






870 


Lys 


He 


Ala 


Thr Glu Ala He Glu 






885 


Thr 


Gin 


Glu 


Arg Lys Phe Glu Ser 








900 


Pro 


Tyr 


Arg 


Asn Ser Val Arg Lys 



14. 



425 




430 






Leu 


Leu Leu 


Asp Glu Ala 


Thr 


Ser 






445 






Val 


Val Gin 


Val Ala Leu 


Asp 


Lys 






460 






Val 


He Ala 


His Arg Leu 


Ser 


Thr 




475 






480 


Gly 


Phe Asp 


Asp Gly Val 


He 


Val 




490 




495 




Met 


Lys Glu 


Lys Gly He 


Tyr 


Phe 


505 




510 






Gly 


Asn Glu 


Val Glu Leu 


Glu 


Asn 




525 






Asn 


Pro Lys 


He Leu Leu 


Leu 


Asp 




540 






Glu 


Ser Glu 


Ala Val Val 


Gin 


Ala 




555 






560 


Arg 


Thr Thr 


He Val He 


Ala 


His 




570 




575 




Asp 


Val He 


Ala Gly Phe 


Asp 


Gly 


585 




590 






His 


Asp Glu 


Leu Met Arg 


Glu 


Lys 




605 






Thr 


Gin Thr 


Arg Gly Asn 


Glu 


He 






620 






Leu 


Ser Asp 


Glu Lys Ala 


Ala 


Gly 




635 






640 


Ala 


Arg He 


Phe Arg Asn 


Ser 


Thr 




650 




655 




Gin 


Asn Arg 


Leu Asp Glu 


Glu 


Thr 


665 




670 






Pro 


Val Ser 


Phe Leu Lys 


Val 


Leu 






685 






Tyr 


Phe Val 


Val Gly Thr 


Val 


Cys 






700 






Pro 


Ala Phe 


Ser lie He 


Leu 


Ser 




715 






720 


Gly 


Asp Asp 


Ala Val Lys 


Gin 


Gin 




730 




735 




Phe 


Leu Gly 


Leu Gly Val 


Leu 


Ser 


745 


750 






Phe 


Thr Phe 


Gly Lys Ala 


Gly 


Glu 






765 






Met 


Ala Phe 


Lys Ala Met 


Leu 


Arg 






780 






His 


Lys Asn 


Ser Thr Gly 


Ala 


Leu 




795 






800 


Ala 


Gin Val 


Gin Gly Ala 


Thr 


Gly 




810 




815 




Asn 


Thr Ala 


Asn Leu Gly 


Thr 


Gly 


825 




830 






Trp 


Gin Leu 


Thr Leu Leu 


Leu 


Leu 




845 






Ala 


Gly He 


Val Glu Met 


Lys 


Met 




860 






Lys 


Lys Glu 


Met Glu Ala 


Ala 


Gly 




875 






880 


Asn 


He Arg 


Thr Val Val 


Ser 


Leu 




890 




895 




Met 


Tyr Val 


Glu Lys Leu 


His 


Gly 


905 




910 






Ala 


His lie 


Tyr Gly He 


Thr 


Phe 
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915 920 925 

Ser lie Ser Gin Ala Phe Met Tyr Phe Ser Tyr Ala Gly Cys Phe Arg 

930 935 940 

Phe Gly Ser Tyr Leu lie Val Asn Gly His Met Arg Phe Lys Asp Val 
945 950 955 960 

He Leu Val Phe Ser Ala He Val Leu Gly Ala Val Ala Leu Gly His 

965 970 975 

Ala Ser Ser Phe Ala Pro Asp Tyr Ala Lys Ala Lys Leu Ser Ala Ala 

980 985 990 

Tyr Leu Phe Ser Leu Phe Glu Arg Gin Pro Leu He Asp Ser Tyr Ser 

995 1000 1005 

Gly Glu Gly Leu Trp Pro Asp Lys Phe Glu Gly Ser Val Thr Phe Asn 

1010 1015 1020 

Glu Val Val Phe Asn Tyr Pro Thr Arg Ala Asn Val Pro Val Leu Gin 
1025 1030 1035 1040 

Gly Leu Ser Leu Glu Val Lys Lys Gly Gin Thr Leu Ala Leu Val Gly 

1045 " 1050 1055 

Ser Ser Gly Cys Gly Lys Ser Thr Val Val Gin Leu Leu Glu Arg Phe 

1060 1065 1070 

Tyr Asp Pro Met Ala Gly Ser Val Leu Leu Asp Gly Gin Glu Ala Lys 

1075 1080 1085 

Lys Leu Asn Val Gin Trp Leu Arg Ala Gin Leu Gly He Val Ser Gin 

1090 1095 1100 

Glu Pro He Leu Phe Asp Cys Ser He Ala Glu Asn He Ala Tyr Gly 
1105 1110 1115 1120 

Asp Asn Ser Arg Val Val Pro His Asp Glu He Val Arg Ala Ala Lys 

1125 1130 1135 

Glu Ala Asn He His Pro Phe He Glu Thr Leu Pro Gin Lys Tyr Asn 

1140 1145 1150 

Thr Arg Val Gly Asp Lys Gly Thr Gin Leu Ser Gly Gly Gin Lys Gin 

1155 1160 1165 

Arg lie Ala He Ala Arg Ala Leu He Arg Gin Pro Arg Val Leu Leu 

1170 1175 1180 

Leu Asp Glu Ala Thr Ser Ala Leu Asp Thr Glu Ser Glu Lys Val Val 
1185 1190 1195 1200 

Gin Glu Ala Leu Asp Lys Ala Arg Glu Gly Arg Thr Cys He Val He 

1205 1210 1215 

Ala His Arg Leu Ser Thr He Gin Asn Ala Asp Leu He Val Val He 

1220 1225 1230 

Glu Asn Gly Lys Val Lys Glu His Gly Thr His Gin Gin Leu Leu Ala 

1235 1240 1245 

Gin Lys Gly He Tyr Phe Ser Met Val Asn He Gin Ala Gly Thr Gin 

1250 1255 1260 

Asn Leu 
1265 

<210> 7 
<211> 1207 
<212> PRT 

<213> Arabidopsis thaliana 
<300> 

<308> Genbank A42150 
<309> 1997-03-13 

<400> 7 

Glu Lys Met Met Glu Glu Val Leu Lys Tyr Ala Leu Tyr Phe Leu Val 

1 5 10 15 

Val Gly Ala Ala He Trp Ala Ser Ser Trp Ala Glu He Ser Cys Trp 

20 ' 25 30 

Met Trp Ser Gly Glu Arg Gin Thr Thr Lys Met Arg He Lys Tyr Leu 
35 40 45 
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Glu Ala Ala Leu Asn Gin Asp He Gin Phe Phe Asp Thr Glu Val Arg 

50 55 60 

Thr Ser Asp Val Val Phe Ala He Asn Thr Asp Ala Val Met Val Gin 
65 70 75 80 

Asp Ala He Ser Glu Lys Leu Gly Asn Phe He His Tyr Met Ala Thr 

85 90 95 

Phe Val Ser Gly Phe He Val Gly Phe Thr Ala Val Trp Gin Leu Ala 

100 105 HO 

Leu Val Thr Leu Ala Val Val Pro Leu He Ala Val He Gly Gly He 

115 120 125 

His Thr Thr Thr Leu Ser Lys Leu Ser Asn Lys Ser Gin Glu Ser Leu 

130 135 140 

Ser Gin Ala Gly Asn He Val Glu Gin Thr Val Val Gin He Arg Val 
145 ~ 150 155 160 

Val Met Ala Phe Val Gly Glu Ser Arg Ala Ser Gin Ala Tyr Ser Ser 

165 170 175 

Ala Leu Lys He Ala Gin Lys Leu Gly Tyr Lys Thr Gly Leu Ala Lys 

180 185 190 

Gly Met Gly Leu Gly Ala Thr Tyr Phe Val Val Phe Cys Cys Tyr Ala 

195 200 205 

Leu Leu Leu Trp Tyr Gly Gly Tyr Leu Val Arg His His Leu Thr Asn 

210 215 220 

Gly Gly Leu Ala He Ala Thr Met Phe Ala Val Met He Gly Gly Leu 
225 230 235 240 

Ala Leu Gly Gin Ser Ala Pro Ser Met Ala Ala Phe Ala Lys Ala Lys 

245 250 255 

Val Ala Ala Ala Lys He Phe Arg He He Asp His Lys Pro Thr He 

260 265 270 

Glu Arg Asn Ser Glu Ser Gly Val Glu Leu Asp Ser Val Thr Gly Leu 

275 280 285 

Val Glu Leu Lys Asn Val Asp Phe Ser Tyr Pro Ser Arg Pro Asp Val 

290 295 300 

Lys He Leu Asn Asn Phe Cys Leu Ser Val Pro Ala Gly Lys Thr He 
305 310 315 320 

Ala Leu Val Gly Ser Ser Gly Ser Gly Lys Ser Thr Val Val Ser Leu 

325 330 335 

He Glu Arg Phe Tyr Asp Pro Asn Ser Gly Gin Val Leu Leu Asp Gly 

340 345 350 

Gin Asp Leu Lys Thr Leu Lys Leu Arg Trp Leu Arg Gin Gin He Gly 

355 360 365 

Leu Val Ser Gin Glu Pro Ala Leu Phe Ala Thr Ser He Lys Glu Asn 

370 375 380 

He Leu Leu Gly Arg Pro Asp Ala Asp Gin Val Glu He Glu Glu Ala 
385 " 390 395 400 

Ala Arg Val Ala Asn Ala His Ser Phe He He Lys Leu Pro Asp Gly 

405 410 415 

Phe Asp Thr Gin Val Gly Glu Arg Gly Leu Gin Leu Ser Gly Gly Gin 

420 425 430 

Lys Gin Arg He Ala He Ala Arg Ala Met Leu Lys Asn Pro Ala lie 

435 440 445 

Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Ser Glu Ser Glu Lys 

450 * 455 460 

Leu Val Gin Glu Ala Leu Asp Arg Phe Met He Gly Arg Thr Thr Leu 
465 470 475 480 

He He Ala His Arg Leu Ser Thr He Arg Lys Ala Asp Leu Val Ala 

485 490 495 

Val Leu Gin Gin Gly Ser Val Ser Glu He Gly Thr His Asp Glu Leu 

500 505 510 

Phe Ser Lys Gly Glu Asn Gly Val Tyr Ala Lys Leu He Lys Met Gin. 

515 " 520 525 

Glu Ala Ala His Glu Thr Ala Met Ser Asn Ala Arg Lys Ser Ser Ala 
530 535 540 
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Arg Pro Ser Ser Ala Arg Asn Ser Val Ser Ser Pro He Met Thr Arg 
545 550 555 560 

Asn Ser Ser Tyr Gly Arg Ser Pro Tyr Ser Arg Arg Leu Ser Asp Phe 

565 570 575 

Ser Thr Ser Asp Phe Ser Leu Ser He Asp Ala Ser Ser Tyr Pro Asn 

580 585 590 

Tyr Arg Asn Glu Lys Leu Ala Phe Lys Asp Gin Ala Asn Ser Phe Trp 

595 600 605 

Arg Leu Ala Lys Met Asn Ser Pro Glu Trp Lys Tyr Ala Leu Leu Gly 

610 615 620 

Ser Val Gly Ser Val He Cys Gly Ser Leu Ser Ala Phe Phe Ala Tyr 
625 630 635 640 

Val Leu Ser Ala Val Leu Ser Val Tyr Tyr Asn Pro. Asp His Glu Tyr 

645 650 655 

Met He Lys Gin lie Asp Lys Tyr Cys Tyr Leu Leu He Gly Leu Ser 

660 665 670 

Ser Ala Ala Leu Val Phe Asn Thr Leu Gin His Ser Phe Trp Asp He 

675 680 685 

Val Gly Glu Asn Leu Thr Lys Arg Val Arg Glu Lys Met Leu Ser Ala 

690 695 700 

Val Leu Lys Asn Glu Met Ala Trp Phe Asp Gin Glu Glu Asn Glu Ser 
705 ' 710 715 720 

Ala Arg He Ala Ala Arg Leu Ala Leu Asp Ala Asn Asn Val Arg Ser 

725 730 735 

Ala He Gly Asp Arg He Ser Val He Val Gin Asn Thr Ala Leu Met 

740 745 750 

Leu Val Ala Cys Thr Ala Gly Phe Val Leu Gin Trp Arg Leu Ala Leu 

755 760 765 

Val Leu Val Ala Val Phe Pro Val Val Val Ala Ala Thr Val Leu Gin 

770 775 780 

Lys Met Phe Met Thr Gly Phe Ser Gly Asp Leu Glu Ala Ala His Ala 
785 790 795 800 

Lys Gly Thr Gin Leu Ala Gly Glu Ala He Ala Asn Val Arg Thr Val 

805 810 815 

Ala Ala Phe Asn Ser Glu Ala Lys He Val Arg Leu Tyr Thr Ala Asn 

820 825 830 

Leu Glu Pro Pro Leu Lys Arg Cys Phe Trp Lys Gly Gin He Ala Gly 

835 840 845 

Ser Gly Tyr Gly Val Ala Gin Phe Cys Leu Tyr Ala Ser Tyr Ala Leu 

850 855 860 

Gly Leu Trp Tyr Ala Ser Trp Leu Val Lys His Gly He Ser Asp Phe 
865 870 875 880 

Ser Lys Thr lie Arg Val Phe Met Val Leu Met Val Ser Ala Asn Gly 

885 890 895 

Ala Ala Glu Thr Leu Thr Leu Ala Pro Asp Phe He Lys Gly Gly Gin 

900 905 910 

Ala Met Arg Ser Val Phe Glu Leu Leu Asp Arg Lys Thr Glu He Glu 

915 920 925 

Pro Asp Asp Pro Asp Thr Thr Pro Val Pro Asp Arg Leu Arg Gly Glu 

930 935 940 

Val Glu Leu Lys His He Asp Phe Ser Tyr Pro Ser Arg Pro Asp lie 
945 950 955 960 

Gin He Phe Arg Asp Leu Ser Leu Arg Ala Arg Ala Gly Lys Thr Leu 

965 970 975 

Ala Leu Val Gly Pro Ser Gly Cys Gly Lys Ser Ser Val He Ser Leu 

980 985 990 

lie Gin Arg Phe Tyr Glu Pro Ser Ser Gly Arg Val Met lie Asp Gly 

995 1000 1005 

Lys Asp lie Arg Lys Tyr Asn Leu Lys Ala He Arg Lys His He Ala 

1010 1015 1020 

He Val Pro Gin Glu Pro Cys Leu Phe Gly Thr Thr He Tyr Glu Asn 
1025 1030 1035 1Q40 
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He Ala Tyr Gly His Glu Cys Ala Thr Glu Ala Glu He He Gin Ala 

1045 1050 1055 

Ala Thr Leu Ala Ser Ala His Lys Phe He Ser Ala. Leu Pro Glu Gly 

1060 1065 1070 

Tyr Lys Thr Tyr Val Gly Glu Arg Gly Val Gin Leu Ser Gly Gly Gin 

1075 ~ 1080 1085 

Lys Gin Arg lie Ala He Ala Arg Ala Leu Val Arg Lys Ala Glu He 

1090 1095 1100 

Met Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Ala Glu Ser Glu Arg 
1105 1110 1115 1120 

Ser Val Gin Glu Ala Leu Asp Gin Ala Cys Ser Gly Arg Thr Ser He 

1125 1130 1135 

Val Val Ala His Arg Leu Ser Thr He Arg Asn Ala His Val He Ala 

1140 1145 1150 

Val He Asp Asp Gly Lys Val Ala Glu Gin Gly Ser His Ser His Leu 

1155 1160 1165 

Leu Lys Asn His Pro Asp Gly He Tyr Ala Arg Met He Gin Leu Gin 

1170 1175 1180 

Arg Phe Thr His Thr Gin Val He Gly Met Thr Ser Gly Ser Ser Ser 
1185 1190 1195 1200 

Arg Val Lys Glu Asp Asp Ala 
1205 

<210> 8 
<211> 1161 
<212> PRT 

<213> Arabidopsis thaliana 
<300> 

<308> Genbank CAA71277 
<309> 1997-05-19 

<400> 8 

Lys Gin Ala Ser His Arg Val Ala Lys Tyr Ser Leu Asp Phe Val Tyr 

1 5 10 15 

Leu Ser Val Ala He Leu Phe Ser Ser Trp Leu Glu Val Ala Cys Trp 

20 25 30 

Met His Thr Gly Glu Arg Gin Ala Ala Lys Met Arg Arg Ala Tyr Leu 

35 .40 45 

Arg Ser Met Leu Ser Gin Asp He Ser Leu Phe Asp Thr Glu Ala Ser 

50 55 60 

Thr Gly Glu Val He Ser Ala lie Thr Ser Asp He Leu Val Val Gin 
65 70 75 80 

Asp Ala Leu Ser Glu Lys Val Gly Asn Phe Leu His Tyr He Ser Arg 

85 90 95 

Phe He Ala Gly Phe Ala He Gly Phe Thr Ser Val Trp Gin He Ser 

100 105 HO 

Leu Val Thr Leu Ser He Val Pro Leu He Ala Leu Ala Gly Gly He 

115 120 125 

Tyr Ala Phe Val Ala He Gly Leu He Ala Arg Val Arg Lys Ser Tyr 

130 135 140 

He Lys Ala Gly Glu He Ala Glu Glu Val He Gly Asn Val Arg Thr 
145 150 155 160 

Val Gin Ala Phe Thr Gly Glu Glu Arg Ala Val Arg Leu Tyr Arg Glu 

165 170 175 

Ala Leu Glu Asn Thr Tyr Lys Tyr Gly Arg Lys Ala Gly Leu Thr Lys 

180 185 190 

Gly Leu Gly Leu Gly Ser Met His Cys Val Leu Phe Leu Ser Trp Ala 

195 200 205 

Leu Leu Val Trp Phe Thr Ser Val Val Val His Lys Asp He Ala Asp 

210 215 220 

Gly Gly Lys Ser Phe Thr Thr Met Leu Asn Val Val He Ala Gly Leu 
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225 230 235 240 

Ser Leu Gly Gin Ala Ala Pro Asp lie Ser Ala Phe Val Arg Ala Lys 

245 250 255 

Ala Ala Ala Tyr Pro He Phe Lys Met He Glu Arg Asn Thr Val Thr 

260 265 270 

Lys Thr Ser Ala Lys Ser Gly Arg Lys Leu Gly Lys Val Asp Gly His 

275 280 285 

He Gin Phe Lys Asp Ala Thr Phe Ser Tyr Pro Ser Arg Pro Asp Val 

290 295 300 

Val He Phe Asp Arg Leu Asn Leu Ala He Pro Ala Gly Lys He Val 
305 310 315 320 

Ala Leu Val Gly Gly Ser Gly Ser Gly Lys Ser Thr Val He Ser Leu 

325 330 335 

He Glu Arg Phe Tyr Glu Pro He Ser Gly Ala Val Leu Leu Asp Gly 

340 345 350 

Asn Asn He Ser Glu Leu Asp He Lys Trp Leu Arg Gly Gin He Gly 

355 360 365 

Leu Val Asn Gin Glu Pro Ala Leu Phe Ala Thr Thr He Arg Glu Asn 

370 375 380 

He Leu Tyr Gly Lys Asp Asp Ala Thr Ala Glu Glu He Thr Arg Ala 
385 390 395 400 

Ala Lys Leu Ser Glu Ala He Ser Phe He Asn Asn Leu Pro Glu Gly 

405 410 415 

Phe Glu Thr Gin Val Gly Glu Arg Gly He Gin Leu Ser Gly Gly Gin 

420 425 430 

Lys Gin Arg He Ala He Ser Arg Ala He Val Lys Asn Pro Ser He 

435 440 445 

Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Ala Glu Ser Glu Lys 

450 455 460 

Ser Val Gin Glu Ala Leu Asp Arg Val Met Val Gly Arg Thr Thr Val 
465 470 475 480 

Val Val Ala His Arg Leu Ser Thr Val Arg Asn Ala Asp He He Ala 

485 490 495 

Val Val His Glu Gly Lys He Val Glu Phe Gly Asn His Glu Asn Leu 

500 ~ 505 510 

He Ser Asn Pro Asp Gly Ala Tyr Ser Ser Leu Leu Arg Leu Gin Glu 

515 520 525 

Thr Ala Ser Leu Gin Arg Asn Pro Ser Leu Asn Arg Thr Leu Ser Arg 

530 535 540 

Pro His Ser He Lys Tyr Ser Arg Glu Leu Ser Arg Thr Arg Ser Ser 
545 550 555 560 

Phe Cys Ser Glu Arg Glu Ser Val Thr Arg Pro Asp Gly Ala Asp Pro 

565 570 575 

Ser Lys Lys Val Lys Val Thr Val Gly Arg Leu Tyr Ser Met He Arg 

580 585 590 

Pro Asp Trp Met Tyr Gly Val Cys Gly Thr He Cys Ala Phe He Ala 

595 600 605 

Gly Ser Gin Met Pro Leu Phe Ala Leu Gly Val Ser Gin Ala Leu Val 

610 615 620 

Ser Tyr Tyr Ser Gly Trp Asp Glu Thr Gin Lys Glu lie Lys Lys He 
625 630 635 640 

Ala He Leu Phe Cys Cys Ala Ser Val He Thr Leu He Val Tyr Thr 

645 650 655 

He Glu His He Cys Phe Gly Thr Met Gly Glu Arg Leu Thr Leu Arg 

660 665 670 

Val Arg Glu Asn Met Phe Arg Ala He Leu Lys Asn Glu He Gly Trp 

675 680 685 

Phe Asp Glu Val Asp Asn Thr Ser Ser Met Leu Ala Ser Arg Leu Glu 

690 ' 695 700 

Ser Asp Ala Thr Leu Leu Lys Thr He Val Val Asp Arg Ser Thr He 
705 710 715 720 

Leu Leu Gin Asn Leu Gly Leu Val Val Thr Ser Phe He He Ala Phe 
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725 730 735 

lie Leu Asn Trp Arg Leu Thr Leu Val Val Leu Ala Thr Tyr Pro Leu 

740 ' 745 750 

Val lie Ser Gly His lie Ser Glu Lys Leu Phe Met Gin Gly Tyr Gly 

755 . 760 765 

Gly Asp Leu Asn Lys Ala Tyr Leu Lys Ala Asn Met Leu Ala Gly Glu 

770 . 775 780 

Ser Val Ser Asn He Arg Thr Val Ala Ala Phe Cys Ala Glu Glu Lys 
785 790 795 800 

He Leu Glu Leu Tyr Ser Arg Glu Leu Leu Glu Pro Ser Lys Ser Ser 

805 810 815 

Phe Arg Arg Gly Gin He Ala Gly Leu Phe Tyr Gly Val Ser Gin Phe 

820 825 830 

Phe He Phe Ser Ser Tyr Gly Leu Ala Leu Trp Tyr Gly Ser Thr Leu 

835 840 845 

Met Asp Lys Gly Leu Ala Gly Phe Lys Ser Val Met Lys Thr Phe Met 

850 855 860 

Val Leu lie. Val Thr Ala Leu Ala Met Gly Glu Thr Leu Ala Leu Ala 
865 870 875 880 

Pro Asp Leu Leu Lys Gly Asn Gin Met Val Ala Ser Val Phe Glu He 

885 890 895 

Leu Asp Arg Lys Thr Gin He Val Gly Glu Thr Ser Glu Glu Leu Asn 

900 905 910 

Asn Val Glu Gly Thr He Glu Leu Lys Gly Val His Phe Ser Tyr Pro 

915 920 925 

Ser Arg Pro Asp Val Val He Phe Arg Asp Phe Asp Leu He Val Arg 

930 935 940 

Ala Gly Lys Ser Met Ala Leu Val Gly Gin Ser Gly Ser Gly Lys Ser 
945 950 955 960 

Ser Val He Ser Leu He Leu Arg Phe Tyr Asp Pro Thr Ala Gly Lys 

965 970 975 

Val Met lie Glu Gly Lys Asp He Lys Lys Leu Asp Leu Lys Ala Leu 

980 985 990 

Arg Lys His He Gly Leu Val Gin Gin Glu Pro Ala Leu Phe Ala Thr 

995 1000 1005 

Thr He Tyr Glu Asn lie Leu Tyr Gly Asn Glu Gly Ala Ser Gin Ser 

1010 1015 1020 

Glu Val Val Glu Ser Ala Met Leu Ala Asn Ala His Ser Phe He Thr 
1025 1030 1035 1040 

Ser Leu Pro Glu Gly Tyr Ser Thr Lys Val Gly Glu Arg Gly Val Gin 

1045 1050 1055 

Met Ser Gly Gly Gin Arg Gin Arg He Ala He Ala Arg Ala He Leu 

1060 1065 1070 

Lys Asn Pro Ala He Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp 

1075 1080 1085 

Val Glu Ser Glu Arg Val Val Gin Gin Ala Leu Asp Arg Leu Met Ala 

1090 1095 1100 

Asn Arg Thr Thr Val Val Val Ala His Arg Leu Ser Thr He Lys Asn 
1105 1110 1115 1120 

Ala Asp Thr He Ser Val Leu His Gly Gly Lys He Val Glu Gin Gly 

1125 1130 1135 

Ser His Arg Lys Leu Val Leu Asn Lys Ser Gly Pro Tyr Phe Lys Leu 

1140 1145 1150 

He Ser Leu Gin Gin Gin Gin Gin Pro 
1155 1160 

<210> 9 
<211> 986 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> synthetic consensus sequence 





<400> 


9 












Met Asp 


Glu 


Gly 


Ala 


Leu Ser Asp 


Arg Lys Lys Lys Val 


Gly Val Leu 


1 








5 






10 


15 


Phe Arg 


Tyr 


Ala 


Asp 


Trp Asp Lys 


Leu Met Leu Gly Thr 


Leu Ala Ala 








20 








25 


30 


He 


He 


His 


Gly 


Ser 


Leu Pro 


Leu 


Met Met He Val Phe 


Gly Glu Met 






35 






40 


45 




Thr Asp 


Phe 


Ala 


Ser 


Lys Leu Glu 


Glu Glu Met Thr Arg 


Tyr Ala Tyr 




50 








55 




60 




Tyr Tyr 


Ser 


Giy 


Leu 


Gly Ala Gly 


Val Leu Val Ala Tyr 


He Gin Val 


65 










70 




75 


80 


Ser 


Trp 


Leu 


Ala 


Ala 


Gly Arg Gin 


He Arg Lys He Arg 


Lys Phe Phe 








85 






90 


95 


His 


Ala 


He 


Leu 


Ara 


Gin Glu 


lie 


Gly Trp Phe Asp He 


Thr Gly Glu 








100 








105 


110 


Leu 


Asn 


Thr 


Ara 


Leu 


Thr Asp Asp 


He Ser Lys He Asn 


Asp Gly He 






115 








120 


125 




Gly Asp 


Lvs 


Val 


Glv 
j 


Met Phe 


Phe 


Gin Val Ala Thr Phe 


Leu Ala Gly 




130 








135 




140 




fne 


r^ e 

lie 


Val 


Glv 


Phe 


He Gly 


Tm 


Lys Leu Thr Leu Val 


He Leu Ala 


1*1 J 








150 




155 


160 


lie 




Pro 


He 


lie 


fZ 1 vr T.en 


Ser 


Ala Ala Val Trp Ala Lys He Leu 










165 




170 


175 


Car 




Ser 


Lys 


Glu 


XiCU Alu 




Ala Lys Ala Gly Ala Val Ala Glu 








180 








185 


190 


Glu 


Leu 


Gly 


Ala 


He 


Arg Thr 


Val 


He Ala Phe Gly Gly Gin Lys Glu 






195 








200 


205 




Leu 


Glu 


Arq 


Tvr 


Gin 


Lys Leu 


Glu 


Ala Lys Lys He Gly He Lys Lys 




210 








215 




220 




Ala 


He 


Ser 


Ala 


He 


Ser Met 


Glv 


Ala Phe Leu Leu lie Tyr Ala Ser 


225 










230 


235 


240 


Tyr 


Ala 


Leu 


Ala 


Phe 


T r*D Tvr 


Glv 


Ser Thr Leu Val He Ser Glu Tyr 








245 






250 


255 


Thr 


lie 


Gly 


Ala 


Met 


Thr Val 


Phe 


Phe Ser He Leu He Gly Ala Phe 






260 








265 


270 


Ser 


Val 


Gly 


Gin 


Ala 


Ala Pro 


He 


Asp Ala Phe Ala Asn Ala Arg Gly 






275 








280 


285 




Ala 


Ala 


Tyr 


He 


Phe 


Lys He 


He 


Asp Asn Pro Ser lie Asp Ser Phe 




290 






295 




300 




Ser 


Gly 


His 


Lvs 


Pro 


Asp He 


Lys 


Gly Asn Leu Glu Phe Lys Asp Val 


305 










310 




315 


320 


His 


Phe 


Ser 


Tvr 


Pro 


Ser Arg 


Glu 


Val Lys lie Leu Lys Gly Leu Asn 










325 




330 


335 


Leu 


Lys 


Val 


Ser 


Gly 

Jr 


Gin Thr 


Val 


Ala Leu Val Gly Ser Gly Cys Gly 






340 






345 


350 


Lys 


Ser 


Thr 


Thr 


Val 


Gin Leu 


lie 


Gin Arg Leu Tyr Asp 


Pro Glu Gly 






355 








360 


365 




Val 


He 


Asp 


Glv 


Gin 


Asp He 


Arg 


Thr lie Asn Val Arg Tyr Leu Arg 




370 








375 




380 




Glu 


He 


He 


Gly 


Val 


Val Ser 


Gin 


Glu Pro Val Leu Phe 


Ala Thr Thr 


385 








390 




395 


400 


He 


Ala 


Glu 


Asn 


He 


Tyr Gly 


Arg 


Asp Val Thr Met Asp Glu He Glu 










405 


410 


415 


Lys 


Ala 


Val 


Lys 


Glu 


Ala Asn 


Ala 


Tyr Glu Phe He Met Lys Leu Pro 






420 








425 


430 


Phe 


Asp 


Thr 


Leu 


Val 


Gly Glu 


Arg 


Gly Ala Gin Leu Ser Gly Gly Gin 






435 








440 


445 




Lys 


Gin 


Arg 


He 


Ala 


He Ala 


Arg 


Ala Leu Val Arg Asn Pro Lys He 


450 








455 




460 




Leu 


Leu 


Leu 


Asp 


Glu 


Ala Thr 


Ser 


Ala Leu Asp Thr Glu Ser Glu Ala 
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465 470 475 480 

Val Val Gin Ala Leu Asp Lys Ala Arg Gly Arg Thr Thr He Val He 

485 490 495 

Ala His Arg Leu Ser Thr Val Arg Asn Ala Asp Val He Ala Gly Phe 

500 505 510 

Glu Asp Gly Val He Val Glu Gly Ser His Asp Glu Leu Met Lys Lys 

515 520 525 

Gly Val Tyr Phe Lys Leu Val Met Gin Thr Gly He Asn Ser Glu Ala 

530 535 540 

Met Lys Ser Leu Arg Ser Ser Gin Asp Arg Asp Asp Leu Glu Val Pro 
545 550 555 560 

Val Ser Phe Trp Arg Val Leu Lys Leu Asn Thr Glu Trp Pro Tyr Val 

565 570 575 

Val Gly Thr Val Cys Ala He lie Asn Gly Leu Gin Pro Phe Ala He 

580 585 590 

He Leu Ser He He Ala Val Phe Asp Asp Val Lys Leu Leu Leu Leu 

595 600 605 

Ala Val Val Pro He lie Val Val Ala Gly He Val Glu Met Lys Leu 

610 615 620 

Leu Gly Asn Ala Arg Asp Lys Lys Leu Glu Ala Gly Lys He Ala Thr 
625 630 635 640 

Glu Ala He Glu Asn He Arg Thr Val Val Ser Leu Thr Glu Lys Phe 

645 650 655 

Glu Ser Met Tyr Leu Pro Tyr Arg Asn Ser Val Arg Lys Ala His He 

660 665 670 

Tyr Gly He Thr Phe Ser He Ser Gin Ala Met Tyr Phe Ser Tyr Ala 

675 680 685 

Gly Cys Phe Arg Phe Gly Ala Tyr Leu Val His Gly Leu Met Phe Val 

690 695 700 

He Leu Val Phe Ser Ala He Val Leu Gly Ala Val Ala Leu Gly Thr 
705 710 715 720 

Ser Ser Phe Ala Pro Asp Tyr Ala Lys Ala Lys Leu Ser Ala Ala Leu 

725 730 735 

Phe Leu lie Glu Arg Pro He Asp Ser Tyr Ser Glu Gly Leu Pro Asp 

740 745 750 

Leu Glu Gly Val Phe Val Phe Asn Tyr Pro Thr Arg Pro Asp Val Pro 

755 760 765 

Val Leu Gin Gly Leu Ser Leu Glu Val Lys Lys Gly Gin Thr Leu Ala 

770 775 780 

Leu Val Gly Ser Ser Gly Cys Gly Lys Ser Thr Val Val Gin Leu Leu 
785 790 795 800 

Glu Arg Phe Tyr Asp Pro Leu Ala Gly Lys Val Leu Leu Asp Gly Lys 

805 810 815 

Glu He Lys I<ys Leu Asn Val Gin Trp Leu Arg Ala His Leu Gly He 

820 825 830 

Val Ser Gin Glu Pro He Leu Phe Asp Cys Ser He Ala Glu Asn He 

835 840 845 

Ala Tyr Gly Asp Asn Ser Arg Val Ser Asp Glu He Val Ala Ala Lys 

850 855 860 

Ala Asn He His Phe He Glu Thr Leu Pro Asp Lys Tyr Thr Arg Val 
865 870 875 880 

Gly Asp Lys Gly Thr Gin Leu Ser Gly Gly Gin Lys Gin Arg He Ala 

885 890 895 

Ala He Ala Arg Ala Leu He Arg Gin Pro He Leu Leu Leu Asp Glu 

900 905 910 

Ala Thr Ser Ala Leu Asp Thr Glu Ser Glu Lys Val Val Gin Glu Ala 

915 920 925 

Leu Asp Lys Ala Arg Glu Gly Arg Thr Cys He Val He Ala His Arg 

930 935 940 

Leu Ser Thr He Gin Asn Ala Asp Leu He Val Val lie Asn Gly Lys 
945 950 955 960 

Val Lys Glu His Gly Thr His Gin Gin Leu Leu Ala Gin Lys Gly He 
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965 970 
Met Val Val Gin Ala Gly Thr 
980 985 



975 



Tyr Phe Ser 



<210> 10 
<211> 14000 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> misc_feature 
<222> (3429) . . . (0) 
<223> Translation start codon 

<300> 

<308> Genbank AP000386 
<309> 1999-08-03 

<400> 10 

caattaatta tatcaaaatt tcgtatatca tttaaaattc atcactgatt tttgtttaga 60 

aaaaaagata gatagctatg gacatgacgt cgaattttaa tatatcctat gtaacagtgt 120 

tcatataatc aaaaaagaaa aaataattac tattgtttgt gttctttgca acaatgcgtt 180 

agaaactcgg agaccacttg ttttttctct tatgatttcg cttcagtatg atttagcaaa 240 

gcatctctac ggtacaaaat atcttacgaa ttgacatgat ctgatccaca ctatatcaag 300 

gagttagaaa gacagaactg aaagcttctt agcrtcgatat gtatttcaag ctacggttca 360 

caccatctag ggagacaaga atggaagaag gcatgacgtc aagtcgaact atgcagctct 420 

cttacttaaa tgggtcgaca aacaaatacg gaaccaacta tcatccattc gagcaagcgg 480 

tgataggtgg tatgactaag gtctctaaat ttggtttacg gcgaaaaatt aattgatcaa 540 

aaaatttttg gtttagtgtt ttaagatctt tttcaaaaaa acgttgtatt tgatgtataa 600 

acgctatttc tttttgaaat taatttaata ttatttttgt ttgaaaagaa gaagaaacat 660 

aattcaaaca atccctatct ttaatgttcc aacatcttaa taaatacata attaaaactc 720 

tacgatcaat actacacttc tgacgattat taaatcgcat cgtcgtacac tattattaca 780 

aagaccgttc tatctacatt aatattcttt atatattttt ttttttcatt ttttgagatt 840 

gttggagtat atatccttag attttcttgt atataaaaat agataattat aatgggataa 900 

acgaaacatc gttgaagccg tgaagtggca ttggtctaca ccagagcaac acaaaaagac 960 

aaccacttca catggtctgc tcttacattc ataaccgaat ctaagtcact tttagttggt 1020 

tgtaattttg taactattat tccaacacct ctttttttac tttttagatt gttttataat 1080 

agaaatattt taattcctaa attaataatg aaagtaaatg taatatgagt cagtacaata 1140 

tgtgaaaaac ttaaaaagtt gacagaattt agcatttgat taaaagtgta tgaagaagaa 1200 

gaaaaaagaa gatctttttg tatctataga tttagtgcat aacttttctc agattttcga 1260 

tatatacaag aatttaacat aagaaaaatc aagacaaatg gacctggtta taatcggtta 1320 

tctgttgtaa atattatatt tcatattctt ctccacttca taattcttat tggagttcct 1380 

tcaccaaatg tttgatgttc . cattaaatta atctaccact ctaataagag gtatcgtact 1440 

acaaattaca cttcataaac aagagaagaa cataaatttg aattttttta aaaaaacata 1500 

tgcgttataa caccaaacag taacggacta gctgatcctt gaatttatat tagttgcaaa 1560 

aatttatata tcgaaattga aacatgaatt tttaaaatta ttagaaaatg tatgatgttg 1620 

tctaaatgtg acattacaaa tacatgatgt tgtttaaaaa ttattataaa acaaccaaag 1680 

tttggcgtca ttctggtaaa cgcacttaat gaatttatat acagttaaag attttaacat 1740 

caaattttaa aatgacaact taactaaaat ttgtatccta atatttttac tagagaaact 1800 

cacacatatt tttcaaacaa atgattagtt atatatcctt cgataatagg tattgtgtaa 1860 

aactgtgtgg tttgcacaag tgatccctcg atattctttc tgctaaagat cgacttccca 1920 

cagtttcgat atctcgggtt tgggtgcaat agcatatgct tgtttagtat gcagataatc 1980 

gtatgagaga gtcagagaga tcatctattt ttcatgatat ggtctgagct aatcgatcat 2040 

ttgttattta tactcgataa cgttctaatt tgtatgattt ttcagccttg atctatcaca 2100 

aaatggagat aatgaaatgg tagtcataag aaaggtaatg atccccttga catgcttatt 2160 

ataacacaaa aagtaatcgc tactgattag cttaccctat gatttgaatc atataacttt 2220 

atactaattg tctaagtgtg atgatataat gtatatgatg tcatttataa cttattagta 2280 

aaaatataaa gttcttcatc attgtcataa ggagttaatt ggatatacat caaaaaattt 2340 

cctaaatttt tagaaattat taatcaagtt ataatagaaa ttattaaata aaatatatga 2400 

tgttttctaa atgtgacgat acaaaataca tgatgttgtt caaaacttat tagacagaaa 2460 

ccaaagtaca tcatcattgt gatgaagact ttaatggatt tatacatttc aaagttttaa 2520 

acattactta tttaaatgaa aaattgagct aaaatttgta tacttatatg ttgaccatag 2580 
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aaatacttct atataatttt agacaagttg tctatacata tatatatata tatatatata 2640 

tatatatatg tatttacaca aaaatcaagt tttcttaatg taacaatata aaaatataaa 2700 

atatctaatg ttgtttaaaa attatttaga ccaaaaaaaa aaaaaactgt aatcaaactt 2760 

tcattggatt atatctaact ctaaacatga atttttatac aatgacatgt tcaataaaat 2820 

tcccatttca tagtttgacc agaaaaaatg cttgcagatt tgttaaaatc tctacttatg 2880 

caaaaccatt gaatcacacc agattttgat ttgtagtact ttaaaaatat taattgaaca 2940 

caacataact ttaacaaaaa agatctttat aaaaaaaaaa ctttctaaaa tgacattaat 3000 

taacacaaat tagtcaattc aaacaatcca ataaaatcaa agtaaaccct taattgtttt 3060 

ttttttccat tttacaaaat gttaaacatt aattacatca actagcaaca cgaatctctc 3120 

aatctctcat gtctcaatga gtctttgacc actagtcttc atcactctct ctccctccta 3180 

tatataccaa tcacaccctt ctccaaccac actctcattc tctctctctt ctcaccactt 3240 

caaaacaaga cactcgtaaa agagttcttg aacttcacaa aacaattgtc agattttcaa 3300 

gaaaaacttt ataaaacaaa aaacatttca ttctttctct ctctctctct cactgctcaa 3360 

tgatctcgtt ttctcactaa accaactcgt ttcttcttac tttctttaac tcggatctac 3420 

aaaaaaccat gtcggaaact aacacaaccg atgccaagac tgttccagca gaagcagaga 3480 

agaagaaaga acagagttta ccattcttta aactcttttc tttcgctgat aaatttgatt 3540 

atctcttaat gttcgttggt tctcttggtg ccattgttca tggctcttcc atgcctgtct 3600 

tctttttact ctttggtcaa atggttaatg gatttggtaa aaaccaaatg gatttacatc 3660 

aaatggttca tgaagtctct agagtaagtc ttttttttct ttcttgtttt tatcaaatta 3720 

gatccaagat ctgatctaat ttttgtgttt gtgaaatttt gcagtattct ctatatttcg 3780 

tctacttggg tttggtcgtt tgcttctctt cttacgcagg tttgtttctt acaaatccaa 3840 

tcttcttttt ctttattctt caatcaagaa acccagtaat taaatcacat tataaacaag 3900 

atcaagaaat tttttgcaca aaaaaaaaag aaaacaagat caagaaatta ataaagttta 3960 

gattaaacat ttgcattatt ttcttttttt tggttcaacc aattaaccca ttaacataaa 4020 

ttaaatcaga ttattaggca agaaatctaa aggtgtattt tttgagtttt atttgattct 4080 

tcctcaaagt gaccatttac tctgaaggta aaacattttt ttttactcta caacaaaaac 4140 

ccaacttctt tttttgtttg cttcaaaatt atattaaaaa aagaaagctt aaactttgtt 4200 

taaaatcttg ttttttttgt cattcgcttt tgattagaac taaaaaaacc atttttatag 4260 

aatgttgttt acataagtga tgtaatgggg ttcggaacga actttccgtg ccaaccgctt 4320 

ttatagacaa gaaatctaaa ggtgtaattt ttgagttttt atttgattct tccacaaagt 4380 

gatcatttac tctggtaaac atttttttac tctacaacaa aaacccaact tttttatgtt 4440 

tgcttcaata aataaaaacc tagaaacaga aatcacaatc tagagagaat caaaattata 4500 

tataaaaaaa aaacacttca aatttttaaa aattttaata tgttaaaagg ataagccaag 4560 

tccacgtgat tcatggacta ctactttgtc tatatcgaaa aaaaaaaaaa aatgggcatc 4620 

tctctcacat ttattacaac ttcataaaaa ttcttgtaat aataaccata attttttgtt 4680 

aaataatttt acagagatag catgttggat gtattctgga gaaagacaag tagcagcatt 4740 

aaggaagaaa tatcttgaag cagtattaaa acaagacgtt gggttctttg atactgatgc 4800 

tagaactggt gacattgtct ttagtgtttc tactgatact cttcttgttc aagatgccat 4860 

tagtgaaaag gtaaccattt ttttttttac taccaacaaa atctagtatt taccggcggg 4920 

tcccattttt ttcatgatca atctttatca aagttttggg tccgtttatg ttagcaaccg 4980 

tcgtttatct tgcgcctcag gtggttaggg aagaaaaaag actatagatt gatttttttt 5G4£ 

tttgttaaaa taaaagttgg atttttttca taagattgtg aaaataagta attttttttt 5100 

acaaatttgc tatagaaata gaggattttt ttcactattt aactaaaact taaatgatat 5160 

agtactcatt ttcttacatt tttaatataa aaatagcttc tttttttcca ttgtgaaaaa 5220 

tatttattat gcaaaataat gtttctttaa caagtaaaat aaatatttgt tattaaaaaa 5280 

aatgtgtaaa gtaccttcaa attttaatag tacattttat ctaaaactgt tttttttttt 5340 

gcaatttggt cctgtctctg tggtgcacgt tagcaagcac gagctaacat gcactgggtc 5400 

ccctagtttc attattatca tcactctttt ttattgttga tgaagtgaat taataacaga 5460 

gtagttgaaa aaaaattgaa tttctcaggt tggaaacttt atacattacc tctcaacatt 5520 

tttggcggga ttagtagttg gatttgtatc agcatggaaa ttagctttgt taagtgttgc 5580 

tgtgattccc ggaatcgctt tcgccggagg tttatacgct tatacactca ccggaattac 5640 

ttcaaagagc cgtgaatctt atgctaacgc cggtgttatc gccgagcagg taaataaaga 5700 

ttaaagtttg ttcctttatt tgttttattt aatgcccacc aactgttcga cgtaatgctt 5760 

tttaactgtt cttaggcttg cttctttgta cttgatctct caaatgattt tttttttttt 5820 

tcattgcttc ttatgcaatc caaagactta aacagtgttt tcactaattc aaaatctttg 5880 

cttttctttg gtcggttaat gattgatgtt ttggtttctc aggcaattgc tcaagttcga 5940 

actgtttatt cttatgttgg agagagtaag gcacttaatg cgtattcgga tgcgattcag 6000 

tatacgctta agctcggtta taaagcgggg atggctaaag ggttgggttt aggatgtact 6060 

tatggaatag cttgtatgtc atgggctttg gtgttttggt atgctggagt ttttattcgg 6120 

aatggacaaa ccgatggagg aaaggcgttt actgctatat tctctgctat tgttggtgga 6180 

atgtaagaat tcaaaacata acaatgatgt ggttttgtgt attttgtgtt ttgttattac 6240 

ttggatgtca ctttttgttt ctgtgtgtgt ttttaggagt ttggggcaat ctttctcgaa 6300 
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tcttggggcg tttagtaaag gtaaagcggc tggttataag ttgatggaga taattaacca 6360 

gagaccgacg ataattcaag acccgttgga tggaaaatgt ttggatcaag ttcatgggaa 6420 

cattgagttt aaagatgtga cttttagcta tccttcacgg cctgatgtta tgatcttcag 6480 

gaactttaat attttcttcc cttctgggaa aactgtggcg gttgttggtg ggagtggctc 6540 

tggaaagagt actgttgttt ccctcattga gagattctat gatccaaaca gcggtaattt 6600 

gattgaattt gttttttgct atttgagtct ttggctagtg actggatcat aactttgttt 6660 

attttttctt gatgagcagg gcaaattctg ttggatggtg -ttgagataaa gacgcttcag 6720 

ttgaagtttt tgcgtgaaca aatcgggctt gtgaatcaag aacctgcgct ctttgccact 6780 

actatactag agaacatact ctatggaaag cctgatgcaa caatggttga agttgaagct 6840 

gctgcttccg ctgcgaatgc gcatagtttc attacattac ttcctaaagg ctacgacaca 6900 

caggtataaa tcaaatattt gattatatgt ctagttaatg tcatggcctt ttgcttaatt 6960 

ttttgttgaa tgtcaataca attaggttgg agaacgtggt gttcaactct caggtggaca 7020 

gaagcagaga attgcaattg ctagggcgat gttgaaagac ccaaagattc tgttactaga 7080 

tgaagctaca agcgctcttg atgctagctc tgagagcatt gttcaggaag ctttagacag 7140 

agtcatggtg gggaggacca ctgttgttgt tgctcatcgt ctctgcacca tcagaaatgt 7200 

tgattccatt gccgtgatac agcaaggcca agttgttgaa accggaacac atgaagaact 7260 

cattgccaaa tccggtgctt acgcatccct catcaggttt caggaaatgg ttggtactcg 7320 

agatttctca aacccgtcaa ctcgtcgcac tcgttcaacc cgtttgagcc attcactgtc 7380 

aacgaaatca ctcagtttaa gatcaggaag tttgaggaat ctgagctatt cttacagcac 7440 

tggagctgat ggtcggatag agatgatttc aaatgcagag actgaccgaa agactcgtgc 7500 

ccctgaaaat tacttctaca ggcttctcaa gcttaattca ccggaatggc cttactcaat 7560 

catgggagca gtaggctcaa ttctttctgg tttcattggt cctacatttg ctattgtgat 7620 

gagcaacatg atcgaagtct tctactacac agactatgat tcaatggaaa iggaaaacaaa 7680 

agagtatgtc ttcatctaca ttggtgctgg tctctatgct gtgggtgctt atttgatcca 7740 

acattacttc tttagcatca tgggagaaaa cctcacaaca agagtaagaa gaatgatgct 7800 

ctcaggtatg tatcaaaatc tcctgaattt gcttaaaatc actttcccat ttcttatttt 7860 

ggtttcttga ttgttcttat ttagctatct tgagaaacga agttggttgg ttcgatgagg 7920 

atgaacacaa ctcaagcctg atcgctgcac gtttagctac tgatgcagca gatgttaaat 7980 

ccgctatagc cgagagaatc tcagtaattc tacaaaacat gacttcactt ctcacatcct 8040 

tcatagtcgc cttcatagta gaatggagag tctcacttct catcttaggc acattcccac 8100 

ttctagtcct cgctaacttt gctcaggtaa ataatctaat ctttttactc aaaatctttc 8160 

aatattcatc aatcattaaa atataattgg aatcttgcat tcaccattat gatcttaaga 8220 

aaaaacgaca aaaggccaga tttttataaa tttatatttg cttttcaaaa gttcaaaact 8280 

ttatagaaca tggcaacaca gccactgcct ctacacgtgc ttcatcttct aactttatcc 8340 

aagtttgcat ttatgtttac aataataatc ataaaagaat tattaagaag cttttttttt 8400 

ctactttttg gaaatagtgt aagggtcaag atcatggagc cctcacatca ataaatgtgc 8460 

taaaaaaatt aaaaaacagt aggctttagt ttactctctg ggcatgtgtg aagaatattt 8520 

attatatagt ttctattggt actatgaccc atagataaca gtgttcacga aaatagctaa 8580 

gattcctctg tcttttgctt ctgctaaatc tatcctactt taagctttca tatttacttc 8640 

actctctgaa ctctgaactg tgatcccact tctctcttta tttaattctt ttgcccataa 8700 

aacctcacca caaaaatcca aaaaatctgc aatttttttc cttcttagaa ccaatatttt 8760 

atttagagtt cttcattggt caaagttgtt gtctcagtgc attatttact tatcacagtg 8820 

tgtgtgtcag tgtttttaca ccatccacta gtcaatgttt gcttgtgggt ttctttgttt 8880 

tggtagatta ggttgtgatg agtttttttt ttgtttctaa ctagctgcaa ggttcaggac 8940 

tctgctttga tatatcacca acattttttc acccgtgatc taattattag ttgaaaaatc 9000 

tatcaaatag atttcacaca gaagaacata gataggcatt gtacttgtac tgatgttgat 9060 

gggatagagt gttgcatatg tgatttaact atatggttct acgtcatgtt ttagtggcga 9120 

cttagacctt tgattgtcaa tcttattttt tacaagtgaa ctattattac catctgttgt 9180 

tctaaatcat aagtattaat atatgtggct acattgcagc aactatctct gaagggtttt 9240 

gctggagaca cagctaaggc tcatgcaaag acttcaatga ttgctggtga aggagtcagt 9300 

aacattagaa ccgtagcagc tttcaatgca cagagcaaga ttctctcttt gttctgtcat 9360 

gagcttcgtg tacctcagaa aagaagctta taccgaagtc aaacctcggg tttcctattt 9420 

ggcctctcgc agcttgctct ctatggttct gaggctttaa ttctctggta tggtgcccac 9480 

cttgtgagta aaggcgtgtc aaccttttcc aaagtgatca aagtgtttgt ggttttggtc 9540 

attactgcaa actctgttgc tgaaactgtc agtcttgctc ctgaaattat tcggggaggt 9600 

gaagctgttg gttcggtttt ctcggtcttg gacaggcaga ccaggattga cccggatgat 9660 

gctgatgctg atcccgtgga gacgatccgt ggagacattg agtttaggca tgttgatttc 9720 

gcttaccctt caagacccga cgtcatggtt ttcagggact ttaacctcag aattcgagct 9780 

ggacatagcc aagctcttgt gggcgcgagt gggtcaggga agagttctgt aattgcgatg 9840 

atcgagcggt tttacgaccc tcttgctgga aaagtcatga ttgatggcaa agacatccgc 9900 

cggctaaacc tgaaatctct aaggctcaaa atcggtcttg ttcaacaaga accagctctc 9960 

ttcgcagcaa cgatcttcga caacatcgcc tatggtaaag atggtgcaac tgaatccgag 10020 
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gtaattgatg cagctcgagc cgcaaatgct cacggtttca tcagtggttt acctgaaggt 10080 

tacaaaactc cagtaggcga aagaggagtg cagttatcag gtggacagaa acagaggatc 10140 

gcgatagcaa gagctgtgct caagaaccct acagtgttgc ttctagacga agcaactagc 10200 

gcactagatg cagaatcaga atgcgtgctg caagaggcgt tagagaggct catgagaggt 10260 

cggaccaccg tggtagttgc tcaccgcttg tccaccataa gaggtgttga ttgcattggt 10320 

gtgattcaag acgggcggat tgtggagcaa ggcagccatt cagagctcgt tagccgacca 10380 

gagggagctt attcaaggct gttacagctt caaacacata ggatttgaag cttgatcatg 10440 

gattaaaaac aaaaaatcgg tttgtgtaat tttttttata ttaaaacttt aatttggaag 10500 

atttctatgg actataacga taatatgaat aggtgtagat aatgaagctt ttggagtgtt 10560 

tatgaaggtt ctttaattaa gggttatttt tttcgcattt tgcttatgtg cccgttttgg 10620 

aacataagac tgaactatgt tttcgttatg tttttaattt atgcctcgaa acaaaacaaa 10680 

atcactctaa tactttggtt gatcaaaatt tccctaaaac atttggttca tgactagaat 10740 

tgatcggaca tgttctttta ggggtttgat ctcgcatggt ctaatcaggt tatgtggtct 10800 

tacagcgact taatcaattt agaccggcta tcaaaattag attcagccta aacagcaaaa 10860 

ttgtcttata taatatccta aatcacagac cggatttttt ttttttataa actctatcgg 10920 

ctgatccggt caaccccggt agaaacgcac tgaagtgttt catagtggac ttcaaaccga 10980 

aattttgcag aaattgagat gagaatgagc tctacctcgc attgacatgt tatttcgttc 11040 

ttgagatctt aagtaatttt ttgagatctt aagtaatttt ttggtattca agttccttat 11100 

ttttttgcgg ttactcgtct tgacatgtta tttcgttctt gtgctactta cataccacaa 11160 

tatatgtagg tttctaaaca tatataatag gaatgttgtt ctatatatgt gttgtttgat 11220 

acgtaaagag taatctggtt ctacgtagag tctatctctt gtttcttggt gagattttga 11280 

ctggactgca tggtaatcct cgcgcatggt gtttagttac cggttgcttc attcatttgg 11340 

ccacgctcaa attttctaga agactcttcg gtctttggta caaaggcctc cgatttttcc 11400 

atgacaagtt cattcatatt tttttgtgac aacttgattg ttgttcattt ccgaatattc 11460 

catgaacgta tcaaactcaa taaaatcgat tgttgtcact ccctttaaga tgagatcctc 11520 

aacggcagct tcaccactag ccatgttttc acacatgatc aactatctca catttttact 11580 

aaggctttaa gcaggaatat tacttctaga attttaacaa tctagaattt ctagaggaaa 11640 

tgttcacatg tatatatttt ggtatatgat ctagagaact tatataagat tatcaatgtt 11700 

taatatactc aacaaatggg ttcttctatt ttatcttcta attgttgggg ccttgaatta 11760 

tttagttctt tttcactatt ccttttcttt tatttggtgt gtggagccat aaagttccaa 11820 

taatttctat gaaaagaaat gaataaaatc taaagaaacc aactgtgaag ctctttaatc 11880 

ttattctatc tgttggatct gtttgccttt atttgctttg catgtggttt tgctgtttaa 11940 

gataaaggcc atatcttctc ctaattcttt ttccaaatca acacatcaaa tatataaaag 12000 

actccaaata ttttgaatag aatcataatt tatttaatta caaaaaaact taaagaattc 12060 

aagacctgcg atcacaagac aaatacatgt aaggaattaa atatttaaga taccaaccgc 12120 

taataaataa gactcacaat ttctaaaaat taaaacgaaa aaaagaaaaa attagtcatc 12180 

attgcatatc ttagtgaaca aaaatcaata agaaaaggat aatatacttt ttaccaacat 12240 

cacataaacc taaaagcatc tcacaccaaa agtaaattaa aaactcagaa aaagctgaat 12300 

acaagttatg aattttatgg aaagtgttag tgtgttagtg ttgttatcat tacaacttta 12360 

aaataattat gaacatgaaa aaatagattg aacatgaagc agcattggga aggactcgaa 12420 

tgtccctact catactgaca aaacctcaag tgttc ca gtg cgttaggttt cactttccag 1248.0 

tagtatgtgt atctgctttt tttgtatata ttaatgtaat agatagtgta gagaatcatg 12540 

aggaacattt tgtaatcata gaactaaggc ataattttga tcgtgtgaaa taatactatt 12600 

atttttggta acaaggatga tctggaccga attccgacta attttccagg tgaatatagt 12660 

catcaagtaa cttgatcacc ggattttaaa aatgaattat gaacatgata tcatttcaac 12720 

taatcaaata aaaatgaaaa aattcatgtg ggaatcatat ccctcaaatc tgttaccacc 12780 

aaactacttt catttggttt gtgtgagaaa tataagaatt atcaaacaaa gaaaaaggtt 12840 

ttgcttaatt ataatgatat gaggttcatc aaatttatat accactaaaa tattcctaca 12900 

ctcatccaaa tattattctc gcttatgcag ttttaatcta atgtacaaat catatccata 12960 

agcgcattgg ccgatgctaa aattttggca ttagttgttg atttgttttt cttgagtcct 13020 

atgttcaaaa aatatatttg ataatttaat gtataaatta tacccataat cacattgatc 13080 

aatgcttaaa gttttgtatt agttgttgat tttttgtttg tttgcgtgtg ttgaattttc 13140 

ataaatgaaa agtaaatacg ataatcatat ttcaaaacgt aaaggtgtta attagtagtc 13200 

taaatggtaa aatataaaca gtatattata aaaattacta aaatggtttt gtaaaaaaaa 13260 

tatgctattt gtattataat gaaattcaaa aattttaaat agaacgtata atttctgcac 13320 

aaagaggttt tgaggtgtta taattcatga agtaaatttt atttactgac gggtaagttg 13380 

tgaaaaagtt tgaagactat tttttttgct tttcacagag aaaactactt tcctttttaa 13440 

ttgtatgatg agaaggcaaa agtgcagaca tgtgctttct tttctcccat tttcaacaat 13500 

gtcactcgtt gtattattca tattttagca aactggttat atctatatct atcaatcatt 13560 

tcagaacatc atatccatca gtttttggac attgctacat acgttagtat tgatgtacca 13620 

gttaccctaa caggcttttg catagtgtgg cagaacacgt gaggtgtgat atatgcggat 13680 

gaattctatg ttctgcattt tgttaccatt catataaaag tattgtttta gttgtgctgc 13740 
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ggtttaagtc ctaatacaga tttatttgta gtgttatgtt tataaaccac attggaaccc. 13800 

aaactctaag agaatataag atcaatgtgt aattaataaa ttttatagtc attcggataa 13860 

aacttagaaa gataacaaaa gtaagaacga gtatttttaa gcgaatactc tttagatatt 13920 

ctttgtattg atcattattg acaaaccatt aaacattttt gtagacccta aaatccgata 13980 

tccaaaaaac aattcttatt 14000 



<210> 11 
<211> 29 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 11 

cactgctcaa tgatctcgtt ttctcacta 29 

<210> 12 
<211> 30 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 12 

cttgaatcac accaatgcaa tcaacacctc 30 

<210> 13 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 13 

cattttataa taacgctgcg gacatctac 29 

<210> 14 
<211> 29 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PCR primer 
<400> 14 

tttctccata ttgaccatca tactcattg 
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